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ABSTRACT 


This  report  presents  the  results  of  a  nine-month  test  of  a  prototype 
SDI  system  developed  for  Army  Technical  Libraries.  Diming  the  pilot  test, 
one  thousand  documents  were  cataloged  and  indexed  and  disseminated  to 
twenty-five  scientific  and  technical  personnel  at  U.  S.  Army  Natick  Labora¬ 
tories.  During  the  course  of  the  pilot  test,  statistics  were  accumulated  on 
operating  costs  and  various  parameters  affecting  the  relevancy  of  dissem¬ 
inated  information.  The  prototype  SDI  system  utilizes  graphic  techniques 
for  producing  multiple  copies  of  each  document  abstract.  The  output  of  the 
computerized  matching  function  is  a  set  of  EAM  cards  which  are  subsequent¬ 
ly  used  to  collate  reproduced  abstracts  for  dissemination.  Collation  is  ac¬ 
complished  by  interspersed  gangpunching,  merging,  and  sorting  techniques. 
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FOREWORD 


Over  the  past  decade,  libraries  have  gradually  assumed  a  more 
active  role  in  making  recorded  inf  more  available  to  the  technical 

community.  Through  the  assimilation  oi  data  processing  techniques  pre¬ 
viously  relegated  to  scientific  computation  and  accounting,  the  library  has 
been  able  to  decentralize  access  to  its  collection  of  stored  information. 
Selective  dissemination  of  information  is  an  extension  of  this  trend  towards 
decentralization  of  information  access.  An  SDI  system  delivers,  to  the 
users  in  the  field,  a  representative  portion  of  a  document  selected  on  the 
basis  of  an  interest  profile  stated  by  the  user. 

During  a  first  phase  of  this  contract,  a  number  of  representative 
SDI  systems  was  studied  and  described  in  a  separate  report.  This  report 
is  available  from  Defense  Documentation  Center  as  AD  636916.  It  com¬ 
pares  the  various  implementation  methods  in  current  use. 

Using  the  results  of  the  study  conducted  in  Phase  I,  the  prototype 
SDI  system  was  developed  for  the  Army  Research  Community.  This  re¬ 
port  gives  the  operating  results  obtained  from  a  nine  month  pilot  test  of 
the  prototype  system.  The  pilot  test  demonstrated  the  utility  of  the  system 
in  starting  up  an  SDI  program  at  U.  S.  Army  Natick  Laboratories.  Other 
laboratories  may  utilize  the  system  as  it  exists  directly  or  may  choose  to 
modify  it  to  conform  with  on-going  automated  library  services. 

This  final  report  was  prepared  for  the  ATLIS  Program  under  the 
direction  of  Mr.  Robert  Martin,  Chief  Librarian,  Army  Natick  Laboratories, 
Natick,  Massachusetts.  All  work  was  performed  by  Information  Dynamics 
Corporation,  80  Main  Street,  Reading,  Massachusetts  under  contract  DAI  9- 
129-AMC-957(N). 
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I.  INTRODUCTION  •  PURPOSE  AND  SCOPE  OF  THE  PILOT  TEST 


The  overall  objectives  of  this  contract  were:  to  survey  a  representa¬ 
tive  sample  of  existing  computerized  or  semi-mechanized  systems  for  se¬ 
lective  dissemination  of  information  (SDI);  to  develop  and  describe  an  effec¬ 
tive  procedure  for  establishing  and  operating  a  model  SDI  system;  and  to 
perform  a  pilot  test  of  the  recommended  system  and  evaluate  its  suitability 
for  implementation  at  Army  Natick  Labs  (NLABS)  and  other  Army  research 
activities.  The  work  was  accomplished  in  two  distinct  phases,  separately 
funded  and  sequentially  conducted. 

Phase  I  of  the  project  encompassed  the  survey  of  existing  systems 
and  the  design  and  description  of  a  model  SDI  system  incorporating  the  best 
features  of  existing  systems  and  any  innovations  that  would  enhance  its  ef¬ 
ficiency  or  economy.  Operating  SDI  systems  were  studied  which  demon¬ 
strated  different  techniques  for  accomplishing  selective  dissemination  of 
information.  Operational  aspects  examined  were:  the  type  of  publications 
constituting  information  input;  the  basis  for  screening  and  selecting  input 
and  levels  of  personnel  performing  screening  or  selection;  the  method  of 
abstracting  and  indexing;  the  method  of  vocabulary  control;  the  types  of 
personnel  comprising  the  user  group;  the  number  of  users  and  their  subject 
interests;  the  method  of  deriving  a  users'  interest  profile;  the  strategy  for 
matching  interest  profiles  against  document  descriptions;  the  nature  of  the 
notification  sent  to  the  user;  the  type  of  feedback  collected;  statistical  con¬ 
trols  exerted;  and  statistics  reported  on  the  relevancy  of  notices,  distribu¬ 
tion  of  notices,  and  operating  costs.  At  the  conclusion  of  Phase  I,  the  re¬ 
sults  of  the  systems  survey  and  the  description  of  a  model  system  were  in¬ 
cluded  in  the  final  report,  "Selective  Dissemination  of  Information:  Review 
of  Selected  Systems  and  a  Design  for  Army  Technical  Libraries"  (AD  636016) 
August,  1066. 

The  objective  of  Phase  II,  results  of  which  are  here  reported,  was 
to  pilot  test  the  system  designed  and  recommended  in  Phase  I.  The  pilot 
test  was  conducted  with  twenty-five  professional  personnel  at  Army  Natick 
Laboratories  (NLABS)  and  included  the  processing  of  1000  technical  infor¬ 
mation  units  selected  from  serial  and  monographic  publications  or  abstracts. 

Several  sub-objectives  of  Phase  II  were  implicitly  stated  in 
the  Phase  I  final  report  and  were,  by  contract,  adopted  as  objectives  of 
Phase  II.  One  of  these  objectives  was  to  test  the  feasibility  of  implement¬ 
ing  an  SDI  system  which  started  at  time  "zero"  with  no  controlled  vocabulary 
and  built  u  controlled  vocabulary  as  a  by-product  of  operation.  Another  very 
important  objective  was  to  evaluate  as  accurately  as  possible  all  costs  as¬ 
sociated  with  the  operation  of  the  pilot  system.  These  costs  were  to  include 
labor,  materials,  special  equipment,  machine  utilization,  and  space  require¬ 
ments  but  wore  lo  exclude  the  acquisition  costs  of  information  and  the  cost 
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of  supplying  full-size  documents  or  microreproductions  requested  as  the 
result  of  SDI  operations. 


The  methods  utilized  in  the  pilot  test  were  intended  to  be  capable  of 
extrapolation  to  other  Army  Research  Activities,  and  the  system  was  to  be 
described  in  sufficient  detail  for  direct  implementation  without  outside  con¬ 
sultation.  Procedures  for  implementation  are  described  in  Volume  II  of 
this  final  report. 


II.  THE  USER  POPULATION  AT  NLA  US 


A.  INTEREST  GROUPS  REPRESENTED  AT  NLABS 


The  most  characteristic  attributes  of  Army  Natick  Laboratories  are 
its  extreme  diversity  of  activities  and  the  great  depth  of  detail  represented 
by  each  activity.  Probably  there  is  no  comparable  assemblage  of  this  vari¬ 
ety  of  talents  at  any  other  military  installation.  An  equivalent  range  of  in¬ 
terests  could  only  be  expected  to  exist  at  some  of  the  largest  universities, 
such  as  M.  I.  T.  or  the  California  Institute  of  Technology.  The  findings  of 
this  study  indicate  that  the  variety  of  activities  at  NLABS  imposes  more 
stringent  requirements  on  the  design  and  pilot  test  of  an  SDI  sj3tem  than 
would  be  encountered  at  any  other  Army  installation. 

1.  The  NLABS  Mission 


In  brief,  the  NLABS  mission  is  to  support  personnel  in  a  hos¬ 
tile  environment.  Support  functions  include  provisioning  and  protection 
against  the  elements,  the  implements  of  warfare,  and  attack  by  animal  and 
vegetable  predators.  They  also  include  general  supply  --  food,  fuel,  cloth¬ 
ing,  personal  equipage,  shelter,  and  weapons.  Environmental  conditions 
range  from  those  encountered  below  the  surface  of  the  sea  to  outer  space 
and  in  latitude  from  the  poles  to  the  equator.  Activities  range  from  basic 
research  in  improving  the  properties  of  materials  to  the  styling  and  tailor¬ 
ing  of  military  uniforms. 

2.  Organizational  Structure 

Army  Natick  Laboratories  presently  conducts  research  in  the 
following  seven  divisions: 


Air  Drop  Engineering 

Clothing  and  Organic  Materials 

Containers 

Earth  Sciences 

Foods 

Mechanical  Engineering 
Pioneering  Research 
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These  are  supported  by  a  technical  services  division  and  administered  by 
a  Commanding  General,  Deputy  Commander,  Scientific  Director,  and  sup¬ 
porting  functions  associated  with  base  operation. 

The  Air  Drop  Engineering  Division  develops  systems  for  in¬ 
flight  delivery  of  personnel,  equipment,  and  supplies.  The  division  studies 
materials  suitable  for  container  fabrication,  parachutes,  harness  straps, 
and  packaging,  and  it  designs  and  tests  parachutes  and  other  air  delivery 
devices. 


The  Earth  Sciences  Division  measures  and  describes  world¬ 
wide  climatic  conditions  that  would  affect  the  requirements  for  human  sup¬ 
port.  Geographers,  meteorologists,  and  cartographers  develop  statistics, 
maps,  and  guides  covering  all  types  of  environment  from  jungle  to  mountain 
and  desert  to  arctic  regions. 

The  Pood  Division  conducts  basic  and  applied  research  in 
the  development  of  stable  rations  and  methods  for  producing  food  and  (Stable 
water  where  these  are  lacking  in  the  environment,  food  technologists  are 
concerned  with  the  affects  of  temperature,  humidity,  radiation,  and  biolog¬ 
ical  and  chemical  contamination  on  the  storage  properties,  flavor,  appear¬ 
ance,  and  consistency  of  foods  and  beverages. 

The  Clothing  and  Organic  Materials  Division  designs  a  full 
range  of  clothing  for  both  work  and  dress,  and  it  studies  factors  affecting 
comfort,  serviceability,  wear,  launderability,  appearance,  heat  dissipation, 
and  freedom  of  movement.  In  addition,  the  division  studies  systems  of 
materials  which  can  be  combined  for  protection  against  projectiles,  anti¬ 
personnel  mines,  and  human  traps. 

The  Pioneering  Research  Division  conducts  basic  research 
in  materials  development,  and  it  studies  the  effects  of  radiation,  biological 
and  chemical  attack,  and  other  environmental  factors  on  various  natural 
and  (  /nthetic  materials 

The  Container  Division  develops  new  juckaging  materials 
and  methods  for  forming  and  handling  packages.  Packaging  must  be  de¬ 
veloped  which  meets  current  military  specifications  and  special  additional 
requirements  imjKJsed  by  the  environment  and  the  circumstances  under 
which  the  (Kickagc  is  to  be  used.  In  addition,  the  Container  Division  devel¬ 
ops  s|M*cifications  for  new  types  of  packaging. 
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The  Mechanical  Engineering  Division  investigates  the  prop¬ 
erties  of  materials  and  supports  other  divisions  in  developing  materials  for 
specialized  purposes. 

Throughout  all  divisions  there  is  a  stroi  „  interface  with  elec¬ 
tronics  l)ccause  of  the  dej>endency  on  modern  instrumentation  and  because 
of  the  interface  ijetween  man  and  machine  which  affects  the  design  of  per¬ 
sonnel  equipment. 

'.i.  Pure  Research 

Pure  research  is  conducted  primarily  in  the  development  and 
characterization  of  new  materials  and  products  for  human  consumption  and 
protection.  Research  is  under  way  in  the  development  of  new  methods  for 
producing  foods  and  new  materials  for  protecting  the  physical  integrity  of 
the  human  body  and  for  maintaining  its  chemical  and  thermal  equilibrium. 
Basic  research  is  constantly  conducted  on  all  environmental  factors  which 
affect  both  the  human  and  materials  which  he  uses.  In  the  social  sciences, 
psychologists  are  at  work  to  determine  the  factors  which  control  the  effi¬ 
ciency  and  com  fort  of  man  and  the  various  psychological  factors  that  influ¬ 
ence  the  utility  and  effectiveness  of  personnel  equipment. 

These  representative  research  projects  at  NI,AI>S  indicate 
the  variety  of  problems  approached.  A  complete  listing  would  serve  no  ad¬ 
ditional  pur|>oHc. 

4.  Applied  Research 

Professional  personnel  at  NLABS  are  primarily  charged  with 
the  responsibility  to  develop  reliable  equipment  and  supplies  which  meet  the 
requirements  of  a  particular  mission  or  battlefield  condition.  Almost  all 
development  activities  ultimately  lead  to  a  usable  end  item  such  as  a  uni¬ 
form,  tent,  1*01,  storage  facility,  air  drop  delivery  system,  or  field  ration. 
Much  of  the  upplied  research  is  therefore  pointed  towards  the  testing  and 
evaluation  of  materials  as  they  appear  on  the  market  or  as  the>  are  devel¬ 
op'd  in  the  research  lalior.itory.  Extensive  testing  is  p'rformed  on  various 
plasties,  synthetic  filters,  fabrics,  leather,  protective  face  creams,  fiher- 
re  in  forced  matoriuls.  elastomers,  and  other  products  going  into  the  fab¬ 
rication  of  an  end  item. 


Complete  clothing  and  personnel  equipment  systems  are  al¬ 
so  designed,  fabricated,  and  tested  at  NLABS  facilities.  Complete  cloth¬ 
ing  outfits,  bedding,  tentage,  and  field  packs  are  tested  extensively  under 
simulated  environmental  conditions  to  evaluate  them  in  terms  of  service¬ 
ability  and  comfort. 

5.  Research  and  Development  Facilities 

The  main  research  buildings  at  NLABS  consist  of  climatic 
chambers,  a  complete  shop,  a  radiation  laboratory,  subsistence  evaluation 
laboratory,  and  individual  laboratory  offices.  A  gigantic  solar  furnace  is 
used  to  test  the  high-temperature  properties  of  materials.  Each  office 
laboratory  is  equipped  to  accomplish  the  objectives  of  individual  projects. 
There  is  an  abundance  of  all  types  of  chemical  analytical  equipment  and 
materials  testing  equipment.  Equipment  is  also  available  for  forming  and 
fabricating  various  materials  on  a  laboratory  and  semi -product  ion  basis. 

6.  End  Products  and  Services 

Although  NLABS  does  not  manufacture  an  end  item  on  a  pro¬ 
duction  basis,  it  does  provide  extensive  models  and  feasibility  studies  in 
anticipation  of  full-scale  production.  Primarily,  NLABS  is  an  experimen¬ 
tal  design  facility  which  turns  out  one-of-a-kind  items  that  are  subsequently 
mass  produced  by  contractors.  In  many  instances,  NLABS  collaborates 
with  other  military  and  non-military  Government  research  activities  which 
require  utilization  of  the  specialized  knowledge,  skills,  and  equipment 
available  at  the  NLABS  campus. 

7.  Summary  of  Interests 

The  above  project  facilities  and  organizational  descriptions 
are  given  to  acquaint  the  reader  with  the  extreme  variety  of  interests  to  be 
served  by  an  SDI  system  at  Army  Natick  Laboratories.  Some  of  the  re¬ 
search  fields  encompassed  by  N|*AltS  personnel  are  m.illicmntii  s,  physics, 
chemistry,  meteorology,  anthrt»|*>logy,  chemical,  elect rieal  and  mcchan- 
ical  engineering,  biochemistry  and  Iwcteriology,  psychology,  nutrition, 
gengru|>hy,  inieology,  geophysics,  entymology,  jurisiiolegy,  radiology, 
process  control  quality  assurance,  and  metallurgy.  Of  approximately  600 
professional  personnel,  thirty  percent  hold  a  Master's  or  Doctoral  degree. 
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B. 


SELECTION  OF  PARTICIPANTS  IN  THE  PILOT  TEST 


That  all  professional  personnel  at  NLABS  could  not  be  included  in 
a  pilot  test  was  immediately  obvious.  This  observation  necessitated  se¬ 
lecting  a  sample  of  participants  that  would  not  impair  the  meaningfulness 
of  the  pilot  program  but  would  represent  the  diversified  user  population  at 
NIJVBS. 


1 .  Objectives  of  Selection 

The  primary  objective  of  selection  was  to  limit  the  number 
of  participants  to  a  manageable  group  which  preserved  a  depth  and  breadth 
of  subject  interests  characteristic  of  the  entire  NLABS  population.  Individ¬ 
uals  who  represented  a  wide  number  of  subject  interests  were  thus  selected. 
Some  had  an  in-depth  interest  in  a  particular  subject,  and  others  had  a 
shallower  interest  in  a  wide  number  of  topics.  An  additional  requirement 
was  that  lines  of  communication  be  established  between  the  project  and  the 
participants  so  that  adequate  feedback  could  be  obtained  on  the  effectiveness 
of  dissemination  achieved  by  each  SOI  cycle. 

It  was  ultimately  determined  that  the  above  requirements 
could  be  met  by  a  sample  population  containing  twenty-five  participants.  As 
a  result  of  an  analysis  of  the  activity  of  the  professional  personnel,  the 
Clothing  and  Organic  Materials  Division  was  selected  as  a  pilot  test  bed. 
Within  this  division  activities  ranged  from  |mn.'  research  lit  finished  product 
development,  and  a  broad  range  of  subject  interests  were  represented. 

2.  Range  of  Subject  Interests 

The  range  of  subject  interests  represented  by  the  twenty- 
five  participants  in  the  pilot  test  is  shown  in  Table  I. 

3.  Information  Acquisition  Habits  of  Participants 

The  selection  of  participants  in  the  pilot  test  was  balanced 
with  respect  to  current  information  acquisition  habits.  Some  participants 
were  extremely  heavy  users  of  the  technical  library,  whereas  others  used 
the  library  very  infrequently.  Some  of  the  participants  read  extensively 
after  working  hours,  whereas  others  utilised  recorded  literature  very  little. 
Some  had  comprcht  »»»ivr  literature  collections  at  their  own  desks,  whereas 
others  had  only  a  few-  implications  on  hand. 


TABLE  I 

SUBJECT  INTERESTS  OF  PILOT  PARTICIPANTS 


Acetylenic  organometallic  compounds 

Blast  and  shock  wave  phenomena 

Clothing  manufacturing  processes 

Color  measurement 

Criteria  for  human  comfort 

Cryogenic  life  support  systems 

Decomposition  of  sulfonium  salts 

Deterioration  of  synthetic  fibers 

Effect  of  fabric  geometry  on  physical 
properties 

Foam -in -place  plastics  and  low -tempera¬ 
ture  foaming  systems 

Fluid  mechanics 

Fuels 

Gel  permeation  chromatography 
Graft  copolymers 

Heat  and  mass  transport  through  films 
and  fabrics 

High-impact  performance  of  fibers  and 
fiber  assemblies 

High-temperature  properties  of  materials 

High  vacuum  equipment  ami  techniques 

Impact  behavior  and  abrasion  resistance 
of  textiles 

Ionic  polymerisation 


Life  support  systems  for  terrestrial, 
space,  and  underwater  applications 

Metal  cleaning  and  descaling  compounds 

Metal  finishing 

Military  uniforms 

Molecular  structure  and  molecular 
spectroscopy  of  polymers 

Noise  and  impact  energy  attenuation 

Permeability  of  fabric  systems 

Physical  and  chemical  properties  of 
polymers 

Polymer  rheology  and  molecular  structure 
Protective  clothing 
Protective  finishes  for  textiles 
Reproduction  supplies 
Soaps  and  detergents 

Synthesis  of  monomers  for  polycarbonates 
Synthetic  fibers 

Techniques  for  testing  and  i  xamining 
fabrics 

Textile  dyeing  and  finishing 
Textiles  for  clothing 

Thermal  and  nuclear  radiation  protection 
Water  repellency  of  fabrics 
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Information  attitudes  of  the  users  were  also  somewhat  bal¬ 
anced.  A  few  users  were  extremely  discriminating  in  the  type  of  informa¬ 
tion  that  they  would  read,  but  a  few  were  satisfied  by  a  wide  variety  of  top¬ 
ics  at  varying  depths  of  detail.  On  the  average,  however,  the  reading  habits 
of  the  sample  (x)pulation  followed  the  norms  which  one  would  anticipate  in 
the  entire  Nl.ABS  |xjpulation.  Both  extremes  and  median  points  in  eachan- 
alytical  category  were  represented  in  the  sample  group. 

A  characteristic  of  the  group  that  was  thought  to  be  repre¬ 
sentative  of  Nl.ABS  |x?rsonnel  was  the  degree  of  reliance  on  the  various 
categories  of  literature.  The  user  group  relied  most  heavily  on  journal 
literature  and  least  on  Government  research  reports  obtained  through  sec¬ 
ondary  Government  publication  sources  such  as  CFST1,  NASA/STIP,  and 
DDC.  At  least  half  of  the  group,  however,  received  Government  research 
reports  on  direct  distribution  from  other  Governmental  laboratories  and 
contractors. 
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111 -  INFORMATION  resources  at  nlaus  technical  library 


A.  COLLECTION  COMPOSITION 

Current  conservative  estimates  for  NLABS  holdings  are  1000  peri¬ 
odical  titles,  15,  000  monographs,  and  75,  000  technical  reports.  Figures 
for  the  journal  and  technical  reports  collections  do  not  include  a  separate 
file  of  microfilm  procured  from  various  Government,  university,  and  com 
mercial  sources.  In  addition  to  the  literature  collection,  NLABS  has  an 
excellent  collection  of  maps  to  support  its  terrestrial  sciences  research. 

1.  Journals 


The  mixture  of  activities  in  both  basic  and  applied  research 
is  most  exemplified  by  the  NLABS  journal  collection.  At  least  one-fourth 
of  the  journals  received  regularly  are  trade  publications  which  describe  the 
latest  application  of  new  materials,  processes,  manufacturing  techniques, 
and  product  developments.  The  major  portion  of  the  journal  collection  con¬ 
tains  descriptions  of  the  latest  scientific  advances  in  materials  research, 
food  technology,  synthetic  fiber  production,  plastics,  rubber,  and  other 
technological  subjects.  Each  year  NLABS  publishes  a  list  of  the  technical 
papers  delivered  or  published  by  its  scientists.  Many  of  these  papers  ap¬ 
pear  in  the  very  journals  to  which  the  Library  subscribes.  As  a  matter  of 
interest,  the  abstract  of  one  of  the  these  papers  was  selected  by  the  SDI 
system  for  delivery  to  its  own  author,  who  was  one  of  the  twenty -five  par¬ 
ticipants  in  the  NLABS  SDI  pilot  program. 

The  major  fields  of  concentration  of  the  NLABS  journal  col¬ 
lection  arc  organic  and  inorganic  chemistry,  biological  and  medical  sciences, 
behavioral  and  social  sciences,  anJ  radiology,  and  to  a  lesser  degree,  the 
engineering  sciences. 

2.  Books 

The  15,000  volumes  of  i>ooks  are  stored  on  open  shelves  ac¬ 
cessible  to  the  library  users  as  are  the  current  journal  subscriptions  and 
25,000  bound  journal  volumes.  The  book  collection  is  concentrated  in  sub¬ 
ject  areas  similar  to  those  represented  in  the  journal  collection.  The  books 
are  completely  cataloged  by  the  Library  of  Congress  system,  and  each  book 
is  represented  by  cross-filed  card  sets  in  the  library  card  catalog  which  is 
open  to  the  users. 

3.  Technical  Reports 

The  75,  000  technical  reports  include  approximately  5,  000 
laboratory  notebooks  and  classified  reports.  At  the  present  time,  labora¬ 
tory  notebooks  are  being  cataloged  by  subject.  Technical  reports  authored 
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by  NLABS  scientists  are  completely  indexed  and  multiple  access  may  be 
gained  to  them  through  the  library  card  catalog.  Reports  received  auto¬ 
matically  from  other  Government  laboratories  are  not  completely  indexed, 
however.  Most  of  the  technical  re|)orts  which  the  library  orders  as  a  re¬ 
sult  of  a  user  request  are  allowed  to  remain  with  the  user. 

B.  COLLECTION  UTILIZATION 

Over  60%  of  the  personnel  who  would  have  occasion  to  utilize  the 
library  are  considered  regular  users.  The  majority  of  these  regular  users 
are  scientists  and  engineers,  not  administrators.  At  any  time  during  the 
working  day,  one  can  find  someone  busily  at  work  in  the  library.  Since  un¬ 
bound  journals  are  not  circulated,  a  high  proportion  of  on-site  utilization 
can  be  attributed  to  usage  of  the  journal  collection.  Fifty  percent  of  the  book 
collection  is  circulated  annually.  Requirements  for  return  of  a  borrowed 
item  are  not  stringent  unless  it  is  needed  by  another  user. 

The  technical  reports  collection,  75%  of  which  is  authored  by  NLABS 
scientists,  receives  moderate  use.  Only  2%  of  the  rcjxirts  are  circulated 
annually.  The  low  use  rate  of  technical  reports  can  be  attributed  to  an  ef¬ 
ficient  initial  distribution  system  and  also  to  the  availability  of  personal 
copies  from  such  agencies  as  Defense  Documentation  Center  and  National 
Aeronautics  and  Space  Administration.  Microfilm  plays  a  negligible  infor¬ 
mation  transfer  role  as  do  classified  technical  reports  in  hard  copy.  Special 
journals  and  articles  are  obtained  in  microform  when  high  cost  and  limited 
use  do  not  warrant  acquisition  of  full-size  copy. 

Users  are  permitted  to  remove  current  journal  issues  temporarily 
to  copy  selected  articles  on  off-site  office  copy  equipment.  These  copies 
form  a  nucleus  of  loose  materials  stored  by  many  scientists  within  their  own 
offices.  They  are  often  supplemented  by  copies  obtained  through  various 
university  copy  services.  Most  of  the  scientists  have  their  own  collections 
of  books,  some  of  which  arc  quite  extensive. 

C .  INFORMATION  PROJECTS 

The  library  periodically  publishes  an  accessions  list  which  announces 
books  and  technical  reports.  Sixteen -hundred  reference  requests  are  an¬ 
swered  annually,  some  of  which  occasion  the  use  of  the  Defense  Documenta¬ 
tion  Center  bibliographic  service.  Occasionally  journal  articles  or  techni¬ 
cal  reports  arc  translated  by  members  of  the  staff  or  translations  are 
sought  from  other  NLABS  scientists.  A  lille/author/ subject  card  catalog 
is  maintained  for  books,  periodicals,  and  technical  re|H>r1s  authored  by 
N  lJ\  BS  scientists. 
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Efficient  use  is  currently  being  made  of  most  of  the  NLABS  collec¬ 
tion.  The  area  which  could  have  the  greatest  increase  in  utilization  is  the 
technical  reports  collection.  Minimal  use  is  made  of  the  abstract  bulletins 
published  by  such  Government  agencies  as  DDC,  NASA,  and  AEG.  Although 
low  utilization  of  the  technical  reports  collection  can  be  attributed  to  pri¬ 
mary  distribution,  much  information  still  remains  hidden  within  these  an¬ 
nouncement  media  which  could  be  disseminated  to  advantage. 

Although  the  journal  collection  is  heavily  read,  many  of  the  partici¬ 
pants  complained  that  they  hud  insufficient  time  to  read  ail  of  the  journal 
literature  that  they  required.  Thus,  an  SDI  system  could  enhance  the  pres¬ 
ent  high  utilization  of  the  journal  literature  by  supplying  the  reader  with 
sufficient  information  for  him  to  select  the  most  pertinent  articles. 


IV.  SDI  PILOT  SYSTEM  DESCRIPTION 


A.  INTRODUCTION 


One  of  the  objectives  of  this  program,  as  stated  in  the  introduction, 
was  to  develop  a  model  system  that  contained  the  best  elements  of  success¬ 
ful  systems.  This  objective  was  accomplished  in  the  model  by  combining 
techniques  utilized  in  the  fields  of  digital  and  graphic  information  proces¬ 
sing,  in  addition  to  normal  SDI  techniques. 

A  novel  aspect  of  the  proposed  and  tested  search  strategy  in  this 
model  is  that  matching  user  interest  against  document  description  is  ac¬ 
complished  by  a  combination  of  weighted-term  and  Boolean  strategies. 

Each  method  can  be  used  separately,  or  in  combination,  without  program 
modification  or  change  in  operating  procedures.  This  facility  presents  a 
unique  opportunity  to  adjust  match  strategy  on  the  basis  of  individual  expe¬ 
rience. 


An  additional  feature,  graphic  reproduction  of  the  SDI  notice,  is 
novel  in  method,  but  not  in  scope.  Other  systems  have  produced  multiple 
copies  of  abstracts  on  an  office  duplicator  or  printing  press,  but  none  have 
mated  the  selective  collation  capability  of  EDP  equipment  with  the  speed 
and  accuracy  of  duplicating  equipment.  In  this  system,  information  is  se¬ 
lectively  collated  by  EDP  equipment,  but  reproduced  by  conventional  repro¬ 
duction  equipment.  This  feature  preserves  the  capability  to  reproduce 
whatever  the  eye  sees,  and  yet  retains  selective  collation  of  printed  ab¬ 
stracts  by  punched-eurd  sorting  techniques. 

B.  THE  IN  FORMATION/ INTEREST  MATCH  STRATEGY 

The  Match  Strategy  is  that  criteria  used  for  associating  an  informa¬ 
tion  unit  with  a  user  profile.  The  variables  in  this  two-valued  function  are 
an  abstraction  of  the  user's  interests  and  an  abstraction  of  the  contents  of 
a  unit  of  information  (a  book,  journal  article,  technical  report,  etc. ). 

Three  basic  match  strategies  were  isolated  during  Phase  I  of  this 
contract:  percentage -of -matching -ter ms  (See  A  in  Figure  1),  Boolean(See 
B  in  Figure  1),  and  weighted  (See  C  in  Figure  1).  The  first  of  these  made 
no  use  of  what  can  be  called  the  multidimensional  aspect  of  information  re¬ 
trieval  (i.  e. ,  each  dimension  is  represented  by  a  set  of  terms  describing 
a  different  facet  of  a  question:  retrieval  can  be  sharpened  by  recombining 
the  different  facets,  in  a  controlled  manner,  into  a  single  search  query). 
Since  the  second  and  third  strategies  did  employ  a  multidimensional  ap¬ 
proach,  they  showed  greater  promise  of  yielding  more  relevant  selections. 
The  comments  that  follow  reflect  the  latter  two  methods  plus  a  third,  a  com¬ 
bination  of  the  Boolean  and  the  weighted. 


A)  PERCENTAGE  -OF  -MATCHING  -TERMS  STRATEGY 


A* 

C 

M 


Profile 

Terms 


N 


O 

X 


p 

X 


Document 
|  Terms 


Y 


50%  of  profile  terms  match  document  terms. 


B)  BOOLEAN  STRATEGY 


Profile 

Terms 


Group  1 


Group  2 


At  least  one  term 
in  each  profile 
group  matches  a 
document  term. 
Boolean  statement  is 
(D  or  M)and  (O  or  X). 


C)  WEIGHTED  STRATEGY 


Profile  terms  A,  B, 
C,  and  D  match 
document  terms 
AND  sum  of  weights 
equals  or  exceeds 
threshold. 


Figure  1  Information/Interest  Match  Strategies 
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1.  The  Boolean  Match  Strategy 

In  the  Boolean  Match  Strategy,  the  user's  interest  is  repre¬ 
sented  by  one  or  more  sets  of  descriptive  terms,  but  the  information  con¬ 
tent  of  a  document  is  represented  by  only  one  set  of  terms.  A  selection 
results  if  the  set  of  document  terms  intersects  with  every  set  of  terms  de¬ 
scribing  the  user's  interest  (See  B  in  Figure  1).  The  user's  profile  thus 
takes  on  the  appearance  of  a  truth  statement  such  as,  "if  (a  or  b  or  c)  and 
(d  or  e),  then  select,"  where  a,  b,  and  c  constitute  one  term  set  and  d  and 
e  constitute  a  second  term  set,  each  describing  a  different  facet  of  the  same 
user  interest. 


To  implement  this  search  strategy  in  this  pilot  test,  each 
set  of  terms  describing  a  facet  of  a  user's  interest  was  assigned  a  unique 
number  (See  Figure  2).  The  numbers  were  used  merely  as  labels  for  the 
different  sets  and  did  not  indicate  degree  of  importance  or  order  of  pref¬ 
erence.  The  terms  in  each  profile  are  ordered  by  set  number  and  com¬ 
pared  serially  by  computer  against  the  terms  describing  each  document. 

Each  time  a  user  term  matched  a  document  term,  the  extent  was  marked 
by  setting  a  computer  switch.  When  a  set -number  change  was  detected, 
the  state  of  the  switch  was  tested.  If  the  switch  was  off,  the  document  was 
rejected.  If  the  switch  had  been  on,  then  comparison  continued  with  the 
next  group  of  terms  in  the  interest  profile.  If  the  switch  was  on  for  the 
last  set  of  terms,  then  the  document  was  selected. 

2.  Weighted  Term  Match  Strategy 

In  this  strategy,  each  term  in  the  user  profile  is  assigned  a 
number  whot;e  value  is  chosen  by  the  user.  The  numbers  (called  weights) 
may  be  chosen  by  ranking  the  terms  in  order  of  importance.  The  weights 
associated  with  profile  terms  which  match  document  terms  are  summed 
together.  If  the  sum  of  the  weights  exceeds  a  threshold,  also  stated  by  the 
user,  the  document  is  selected  (See  C  in  Figure  1). 

In  the  original  concept,  the  value  of  the  weight  was  to  be  pro¬ 
portional  to  the  strength  of  the  user's  interest,  in  several  systems  current¬ 
ly  operating,  weights  can  range  from  1  through  9.  Since  the  total  number 
of  terms  in  any  given  profile  and  the  pattern  of  weights  are  unique  for  each 
person,  every  profile  may  have  a  different  threshold.  All  profiles,  however, 
can  be  normalized  to  a  threshold  of  one  by  dividing  the  user's  stated  thresh¬ 
old  into  each  of  the  weights  assigned  by  him  (See  Figure  2).  For  example, 
if  a  user  assigns  a  weight  of  9  to  one  term  and  a  weight  of  1  to  all  other 
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IDC  MATCH  FUNCTION 


Profile  Term 


Normalized 
Weight  Weight 


SET  NO.  1 


Geometry 

Weaves 

Fabric  Construction 

SET  NO.  2 

Bullet  Proof  Fabrics 

Denims 

Broadcloth 

Duck 

Jersey 

Knit  Fabrics 

Mesh  Fabrics 

SET  NO.  3 


Abrasion  Resistance 
Flexibility 
Slip  Resistance 
Stiffness 


2  2/15  =  . 133 

4  4/15  -  . 266 

7  7/15  =  .467 

1  1/15  =  .067 

5  5/15  =  . 333 

7  7/15  = . 467 

4  4/15  = . 266 

7  7/15  = .467 

5  5/15  =  .  333 

5  5/  15  -.333 

4  4/15  =  .266 

6  6/15  =  . 400 

2  2/15  -.133 

6  6/15  a  .  400 


THRESHOLD  --  15 

NORMALIZED  THRESHOLD  =  15/15  =  1.000 


Figure  2  Pilot  Test  Profile 


terms  in  his  profile,  and  requests  a  threshold  of  10,  then  a  normalized 
profile  would  show  a  weight  of  .  9  for  one  term  and  .  1  for  the  remaining 
terms.  By  normalizing  to  a  unit  threshold,  the  threshold  value  of  1  can 
be  automatically  supplied  by  the  computer  program  rather  than  by  the  pro¬ 
file. 


In  the  pilot  test,  the  normalization  to  a  unit  threshold  was 
adopted.  Weights  as  stored  in  machine -readable  data  ranged  from  0  to 
.  999.  These  weights  were  summed  algebraically  and  compared  against  a 
threshold  of  .  999.  If  the  sum  of  the  weights  equaled  or  exceeded  this 
threshold,  the  document  was  selected. 

3.  Combined  Boolean  and  Weighted  Term  Match  Strategies 

In  the  pilot  test,  the  Boolean  and  Weighted  Match  Strategies 
could  be  combined,  at  will,  to  increase  the  discriminating  capacity  of  se¬ 
lection.  Actually,  both  weighted  and  Boolean  aspects  were  present  in  all 
profiles.  But  the  assignment  of  weights  and  the  choice  of  sets  of  terms 
determined  whether  one  or  the  other  or  both  methods  were  in  effect.  For 
instance,  a  profile  containing  only  one  set  of  terms  acted  as  if  it  were  a 
straight  weighted  profile,  since  a  selection  was  limited  only  by  the  sura- 
of-the-weights  criterion  (See  A  in  Figure  3).  Conversely,  a  profile  con¬ 
sisting  of  several  sets  of  terms,  each  having  a  weight  of  .  999  would  act 
as  a  straight  Boolean  strategy,  since  selection  was  limited  only  by  the 
Boolean  criterion  (See  B  in  Figure  3).  A  combination  of  the  two  strategies 
was  operable  when  the  profile  contained  several  term  sets  having  weights 
less  than  .  999  (See  C  in  Figure  3). 

4.  The  Ixuiical  Negative 

In  combining  the  Boolean  and  weighted  strategies,  the  logi¬ 
cal  negative  was  implemented  by  assigning  a  weight  of  /0  to  the  negative 
term.  If  any  profile  term  having  a  weight  off  0  matched  a  document  term, 
the  document  was  de  -selected.  That  is,  the  document  was  barred  from  se¬ 
lection,  even  though  it  may  have  met  both  Boolean  and  weighted  criteria  for 
selection. 

5.  Non-Subject  Control  Over  Dissemination 

During  the  course  of  the  pilot  test,  it  was  found  desirable 
to  include  some'  controls  over  dissemination  which  were  not  subject  ori¬ 
ented.  One  of  thcHc  which  had  boon  foreseen,  but  not  implemented  lu'cnuse 
of  (he  limited  document  input,  was  security  classification.  In  a  full-scale 
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system  involving  the  processing  o l  classified  information,  it  would  be 
necessary  to  limit  dissemination  according  to  security  classification  of 
document  and  security  clearance  level  of  user. 

An  unanticipated  requirement  for  non- subject  control  over 
dissemination  was  the  participant's  desire  to  be  screened  from  levels  of 
subject  coverage  that  were  below  or  above  his  present  requirements.  In 
the  future,  some  mechanism  should  be  provided  for  describing  a  profile 
interest  in  terms  such  as  superficial,  intermediate,  or  detailed.  Subject 
coverage  alone  is  not  a  sufficient  criterion  for  selection.  A  result  of  not 
having  such  a  provision  in  indexing  both  profiles  and  documents  was  that 
documents  matched  a  user's  interests  by  subject,  but  not  by  depth  of  cov¬ 
erage.  Thus,  someone  who  had  only  a  superficial  interest  in  a  topic  might 
be  supplied  with  information  of  a  highly-detailed  nature  or  vice  versa. 

Other  non-subject  categories  might  play  an  equally  impor¬ 
tant  role  in  selection  or,  conversely,  in  screening.  For  instance,  such 
categories  as  materials  application,  material  identification,  and  proper¬ 
ties  of  materials  might  be  devised  to  screen  out  unwanted  information, 
even  though  it  might  be  selected  on  the  basis  of  a  term-for-term  match. 
This  criteria  was  not  necessary  in  the  pilot  test  because  of  the  limited 
number  of  documents  processed  and  the  relatively  small  number  of  notices 
resulting  from  selection.  In  a  full-scale  system,  however,  the  selection 
process  will  need  to  be  more  discriminating  to  avoid  swamping  the  reader 
with  a  large  number  of  peripherally  interesting  selections. 

Early  in  the  pilot  test,  an  experiment  was  performed  to  de¬ 
termine  the  feasibility  of  indexing  documents  by  other -than -subject  terms. 
In  the  experiment  several  documents  were  indexed  by  contract  numbers, 
authors,  publishers,  corporate  sources,  and  other  non- subject  indicators. 
Each  type  of  indicator  was  given  a  distinctive  glossary  number  similar  to 
those  assigned  to  ordinary  subject  terms.  A  demonstration  proved  that 
indexing  by  non-subject  terms  poses  no  additional  problems  to  the  mechan¬ 
ics  of  matching  and  selection.  Non-subject  terms  in  profile*  could  be  dis¬ 
persed  in  any  set  of  terms  comprising  the  subject  poi  tion  of  the  interest 
profile.  The  fact  that  glossary  numbers  differ  does  not  affect  the  selection 
process,  since  there  is  no  program  requirement  that  glossary  numbers 
assigned  to  terms  in  a  profile  be  identical  either  within  the  set  of  terms 
or  within  the  entire  profile. 

Non-subject  terms  may  have  more  value  when  used  in  the 
negative  sense  than  when  used  in  the  positive  sense.  Several  participants 
spoke  a  foreign  language  fluently  and  many  had  knowledge  of  one  or  more 
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foreign  languages.  However,  some  participants  neither  spoke  nor  read  a 
foreign  language;  foreign-language  abstracts  would  be  of  no  direct  use  to 
them.  Thus  foreign  language  designation  could  be  used  in  these  cases  to 
screen  out  selections  that  would  otherwise  be  made  on  the  basis  of  subject 
alone. 


Several  participants  expressed  a  desire  to  exclude  all  arti¬ 
cles  from  specific  trade  journals,  either  because  of  the  usual  poor  quality 
of  their  articles  or  because  of  the  lack  o l  thorough  coverage  in  them.  Such 
exclusions  would  be  feasible,  as  would  those  based  on  contract  numbers, 
authors,  corporate  sources,  and  publishers. 

Exclusions  could  also  be  based  on  other  factors:  the  par¬ 
ticipant  may  cover  specific  sources  through  his  own  regular  reading;  or  he 
may  already  be  informed  of  certain  activities  through  prior  experience. 

C.  INFORMATION  SELECTION  AND  PREPARATION  FOR 
DISSEMINATION 


Due  to  the  wide  variety  of  interests  exhibited  by  the  participants  in 
the  pilot  program,  and  to  the  large  number  of  subjects  represented  by  the 
document  collection,  xt  was  necessary  to  screen  the  document  input  t«>  as¬ 
sure  a  reasonable  number  of  selections  in  the  pilot  program.  Every  docu¬ 
ment  selected  for  inclusion  in  the  test  sample  underwent  the  processing 
steps  outlined  oelow.  Documents  matching  user  interest  profiles  under¬ 
went  additional  processing  steps. 

I.  Information  Selection  -  Preliminary  Screening 

The  purpose  of  preliminary  document  screening  was  to 
make  possible  the  selection  and  dissemination  of  a  sample  of  documents 
in  the  time  limit  imposed  by  the  contract.  During  the  pilot  program,  1000 
documents  were  to  be  indexed  and  matched  against  the  interest  profiles  for 
twenty -five  Natick  • abor  stories  participants.  Id  the  sample  of  1000  docu¬ 
ments  were  selected  at  random  from  the  total  population  of  information  re¬ 
sources  available  at  .he  NLABS  library,  a  relatively  small  percentage  of 
these  documents  would  result  in  selections  because  of  the  extreme  breadth 
of  interests  represented  within  these  resources. 

The  total  NLABS  current  information  resources  were  sub¬ 
ject  to  both  screening  and  selection  processes.  In  the  screening  process, 
journals  containing  a  low  percentage  of  relevant  information  were  elimi¬ 
nated  from  further  consideration.  These  journals  were  oriented  primarily 
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toward  the  food  technologists,  leather  chemists,  and  other  fields  not  rep¬ 
resented  by  the  interests  of  the  participants.  After  screening,  the  current 
journals  selected  for  further  processing  were  examined  for  specific  arti¬ 
cles  to  be  included  in  the  test  sample.  In  addition,  articles  were  selected 
as  represented  in  the  Technical  Abstract  Bulletin  published  by  Defense 
Documentation  Center  and  Scientific  and  Technical  Aerospace  Reports  pub¬ 
lished  by  the  NASA  Scientific  and  Technical  Information  Center.  Originally 
books  were  scheduled  for  inclusion  in  the  test  sample,  but  they  were  drop- 
f>ed  from  consideration  because  of  the  relatively  small  number  of  acquisi¬ 
tions  received  ir.  the  subject  area  during  the  pilot  test. 

Articles  were  selected  on  the  basis  of  their  potential  appli¬ 
cability  to  the  interests  of  the  participants.  There  was  no  direct  connection 
made  between  an  article  and  a  participant  so  that  there  would  be  no  guar¬ 
antee  that  a  match  would  occur  even  once.  Rather,  an  article  was  selected 
because  it  related,  in  some  way,  to  the  interests  of  the  participant  group. 
The  selection  process  merely  increased  the  probability  that  a  document 
would  be  selected  at  least  once.  If  the  documents  had  been  selected  at  ran¬ 
dom,  perhaps  only  10%  or  less  would  have  matched  any  of  the  participant's 
profiles  so  4hat  90%,  or  900 documents,  would  have  been  indexed  and  pro¬ 
cessed  without  any  selections  taking  place. 

There  is  some  concern  that  the  selection  process  might  have 
biased  the  sample,  so  that  a  higher  percentage  of  selections  would  be  rele¬ 
vant  than  would  have  been  from  a  random  sampling.  To  a  very  small  degree 
this  hypothesis  is  true  because  subject  terms  applieu  to  documents  were 
limited  to  the  subject  content  of  the  samples  selected  for  inclusion  in  the 
pilot  test.  However,  the  relationship  between  match  strategy  discriminu- 
r  i  power  and  the  breadth  of  subject  coverage  is  not  at  present  known, 

'reas  the  undesirable  effect  of  processing  a  large  number  of  documents 
^*eiding  a  relatively  low  number  of  selections  was  quite  predictable. 

2.  Information  Extraction 

After  selection,  each  article  underwent  subject  indexing  and 
descriptive  cataloging.  In  lescriptive  cataloging,  journal  citation  and  arti¬ 
cle  title  were  recorded  in  machine -readable  form. 

Three  different  subject  heading  lists  were  used  in  subject 
indexing.  These  were:  Thesaurus  of  Engineering  Terms,  Thesaurus  of 
ASTIA  Descriptors,  and  Textile  Technology  Terms  -  An  Information  Re¬ 
trieval  Thesaurus.  The  ASTIA  Thesaurus  was  supplemented  by  DDC  Des¬ 
criptor  Frequencies  and  the  ASTIA  Chemical  Thesaurus,  The  various  the  - 
saurl  were  used  to  index  both  documents  and  interest  profiles  m  the  subject 
areas  to  which  they  were  best  suited. 


During  the  project,  four  indexers  were  assigned  to  the  document  indexing 
tasks.  This  procedure  precluded  the  possibility  that  the  indexer  was  sub¬ 
consciously  steering  an  abstract  to  a  participant,  since  three  of  the  in¬ 
dexers  were  totally  unfamiliar  with  *he  interest  profiles.  Assignment 
ranged  from  three  to  thirty  terms  per  document  with  an  average  of  eight. 
All  terms  were  put  in  machine -readable  form. 

3.  Information  Preparation  tor  Dissemination 

The  standard  processing  for  each  article  consisted  of  the 
following  steps:  (1)  select  journal  article,  and  mark  journal  table  of  con¬ 
tents,  (2)  Xerox  journal  table  of  contents  and  title  page  of  each  article 
marked,  (3)  assign  accession  number  and  stamp  on  table  of  contents  (See 
Figure  4),  title  page,  descriptive  cataloging  keypunch  form  (See  Figure 
5),  and  subject  indexing  keypunch  form  (See  Figure  6),  (4)  perform  sub¬ 
ject  indexing  and  descriptive  cataloging,  (5)  cut  article  title  page  for  repro¬ 
duction  (See  Figure  7),  and  (6)  keypunch  title,  journal  citation  (See  Figure 
8)  and  term  cards  (See  Figure  9). 

In  preparation  for  the  first  SDI  cycle,  title  pages  were  not 
cut  for  reproduction.  As  a  result  of  operating  the  first  cycle  it  was  dis¬ 
covered  that  approximately  50%  of  the  documents  processed  were  selected 
at  least  once.  As  a  result  of  this  discovery,  a  decision  was  made  to  cut 
all  title  pages  prior  to  exercising  the  match  function,  so  that  articles  would 
be  available  for  immediate  reproduction 

D.  VOCABULA liY  CONTROL 


As  a  result  of  an  analysis  performed  in  Phase  l,  a  decision  was 
made  to  utilize  a  controlled  vocabulary  as  opposed  to  a  vocabulary  of  terms 
selected  from  free  text.  This  decision  was  based  on  the  fact  that  results 
were  generally  better  for  those  systems  using  a  controlled  vocabulary. 

1.  The  Multiple  Glossary  Concept 

Since  a  technical  thesaurus,  in  the  areas  of  textiles,  plastics, 
high-temperature  properties  of  materials,  etc.  ,  was  not  available  in  a  sin¬ 
gle  source  document  at  the  commencement  of  the  pilot  test.,  it  was  neces¬ 
sary  to  utilize  several  of  the  existing  glossaries,  thesauri,  subject  head¬ 
ings  lists,  and  other  authorities  currently  available  from  individual  sources. 
On  the  basis  of  a  preliminary  analysis  of  the  interests  of  the  participants, 
several  existing  authority  lists  were  found  to  be  applicable.  These  were: 
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Influence  of  Oblique  Channel  Ends  on 
Screw-Pump  Performance 


Flow  of  very  visions  Im|iii<Is  tin  out'll  screw  pumps  .mil 
metering  /ones  of  extruders  li.is  !x*cn  investigated  l>\ 
various  authors,  a  few  of  whom  .or  listed  in  the 
1‘ini's  (1-fi).  Screw-pump  equations.  which  relate  Him 
late  lo  sjHi‘il,  viscosity,  .mil  discharge  prcssmc.  arc  de 
r  ed  ill  (In -si-  pa|X*rs  lot  a  iiumlx-i  o!  simplifying  as 
siuupt  ions. 

The  liolii-al  screw  channel  is  o'plai-i'il  l>\  a  straight 
prismatic  channel  with  a  icelangiilai  cross  sod  ion  in  alt 
hut  one*  llioorv  (5.  (i).  In  (ho  simplified  (lioors.  one  as¬ 
sumes  such  a  lari'!*  aspect  ratio,  i.c..  ratio  of  ch.muol 

width  to  depth,  that  flow  rates  can  ,***  1  1  1 . . 

"■U-known  velocity  ilisliil" 


and  the  harrol.  Note  that  the  channel  has  ohlirpie  ends, 
which  are  ignored  in  the  simplified  theorv.  lsotlwrinal 
Him  of  a  very  viscous  Newtonian  liipiid  is  assumed. 

The  How  auahsis  is  simplified  when  velocities  arc 
iloscrilx'd  relative  to  the  screw.  Those  relative  velocities 
do  not  change  when  either  the  barrel  is  at  a  standstill 
and  the  screw  rotates,  or  the  screw  is  at  standstill  and 
the  barrel  rotates  with  an  eiptal  speed  in  the  opposite 
direction,  provided  inertial  and  lx>d\  forces  can  lx*  ig 
nored.  Thus  the  theorv  is  restricted  to  low  llcynolds 
no mhers,  which  requirement  is  satisfied  in  most  screw 
pumps  since  dies  are  oxch*,;  '  '  *niinp  very 

» iscous  liquids  at  l<"  the 


...iputied  tlieory  (2,  3,  4)  that 
are  ttxeti  me  derivation  of  end  effects  are  discussed 
first.  A  schematic  diagram  of  a  screw  pump  is  slum’ll  in 
Fiflurr  /.  The*  more  important  nomenclature  is  shown  in 
the  cross  section  of  Figure  2.  Tlu*  scresv-pnmp  channel 
is  forme. i  !x*lween  tin*  flights,  the  screw  root  surface, 

kjlymir  [NGiNmiNG  and  scicnci,  January,  iw 
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Modern  Textiles  47/9  Sep  66 

DURABLE  PRESS 

IfEse  Kole  of  Polyesters 


by  Wells  Denyes 
Senior  Textile  Engineer 
Textile  Development  Division 
Tennessee  Eastman  Co 


W  itiiout  QUKSTION,  the  most 
exciting  m  w  phrase  in  the  entire 
textile  industry  right  now  is 
“Durable  Press."  From  the  re¬ 
search  chemist  to  the  housewife, 
textile  products  today  cannot  be 
discussed  without  mentioning  the 
new  term.  Researchers  are  already 
.seeking  new  resins,  new  fibers. 


new  fabrics,  and  new  applications 
for  dur  ble  press.  Housewives  are 
busy  res.  eking  the  family  ward¬ 
robes  and  u  mshings  to  take  full 
advantage  of  the  many  wonderful 
properties  of  durable  press. 


Figure  7  Cut  Title  Page 


28 


Figure 


8  Title 


and  JoffinaLSiSai!^^--'— ' “ 


29 


_ JS  00  0.^1  lH,j 

Ob 

_  H'tVi.-':  •  L*N I C  ftST  V£h I CLES 

* 

— > 

J2QQ021 190 

J3 00021 190 

.03 

03 

HYDROGEN 

OXYGEN  "  - 

....  . .  - - — 

—A 

_J200021190 

_08 _ 

FLAMES 

00 02 1190 

0:3 

_SI  AJNLESS.JSTEEL ‘ 

N 

._J£A0jJ2jii.90 _ 

_03 _ 

_THEMAL  PROPERTIES  . 

s 

J200021 190 

OS 

BONDING 

\ 

J2Q0021 190 

03 

LAMINATES 

J200021 190 

08 

'  METALS 

_ J200021190 _ 

liTrt  rV*  s  m  .=* 

JB _ 1 

ZIRCONIUM  COMPOUNDS 

n*J  t  r*rr 

- — _ 

_ 

J2 00 02 11 90 


OS 


ALUM I HUM  COMPOUNDS 


J200021i90'  OS  HEAT-RES istant  materials 


* Jg^yi£::i^4 M. 

:  JeMOdTi^o  os 


COATINGS 


0011,0000:0000  0000000000000000  000  OOOQOOOOOOOOOOOOOOOOQOQQOOOOOOOOQOOOOOOOOOOO 

I  I  )  •  i  I  I  •  •  It  ■  0  UM  »»  II  I*  Hit  »»)))*  »!*»  Hint  »  HUM  »» II  a»«4lO  »•«<>«•«>  •*<!»]>  UWMUWIIMtlHUUUKUHIIUtllili  111]  14  HI*  II  mill 

1 11 1 1  11 1 II II 11 1  ill  1 1  I II 1 1 1  1 1 II 1 1 1 11 1 1 1 1 1 1 1 1  111  1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

2222.2222222222222222222222222222222222222222222222222222222222222222222222222 
13133111113333333333  31333  3  33  333333333331333333333333331331133333333333333333  | 

044444444444444444444444  44444444444444444444444««4444444444444444 4444444444444*.  S 
55555  55  5  5555555,553  55.ST  5 5 55 55 55555  55555555555515-5155  5  55S  5  5  5  5  5  SSS55  55  S55'5SS5S  5  5  5  5 
*(((!(( (OSO tt (((((((((( 4(60(1  (i((  ((((((((  0  0  0  (  4  (0  6 Db 6 StSttt tit  0  (  0  0  4 0  0  4  0  0  4  0  0  4  0  (  0  0 

rn'iiiiiimninnmmnnmi',  imninimmimmmmmmnmnm 

1(011111(411111  l(  II  (((((( !(!(((  1(1(1(0  0(1  S((l(0(ll((l((l((t(ll((lim((  41(3(1(3 
iimm  iimmsms  99  999999  99959999999999999999999999999995999999999999993 

1  t  )  *  s  t  1  •  %  *  h  u  u  n  n  «  »  «  *  m  it  un  >«  n  h  v  hit  a  j«  w  u  x  »  *  v  m  ji  m  ••  «i  4j  m  os  4  41  «mim  v  m  ut<  »  »  v  u  mu  t>  u  *1  u  »5  t*  w  u  11  «  m  »  »j  »  »  u  u  >t  a  u 


Figure  9  Document  Term  Punched  Cards 


30 


Textile  TV  oology  Terms:  An  Information 
Retrieval  *esaurus.  first  edition,  second 
printing,  February  1966,  Textile  Information 
Center,  Institute  of  Textile  Technology, 
Charlottesville,  Virginia 

Thesaurus  of  Engineering  Terms,  first 


edition.  May  1964,  Engineers  Joint  Council, 

New  York,  New  York 

Thesaurus  of  ASTIA  Descriptors,  second 
edition,  December  1962,  Armed  Services 
Technical  Information  Agency,  Arlington, 

Virginia. 

In  the  subject  area  of  textile  technology,  a  thesaurus  com¬ 
parable  to  that  published  by  ITT  was  located  at  M.  I.  T.  This  was  the 
Thesaurus  of  Textile  Engineering  Terms,  first  edition,  February  1966, 
by  Stanley  Backer,  et  al,  M.  I.  T.  Department  of  Mechanical  Engineering, 
Fibres  and  Polymers  Division.  The  Thesaurus  of  Astia  Descriptors  was 
supplemented  by  the  Astia  Chemical  Thesaurus,  first  edition,  December 
1962,  the  DDC  Supplement  to  the  Thesaurus  of  Astia  Descriptors,  second 
edition,  April  1964,  the  DDC  Thesaurus  Code  Manual,  second  edition, 
August  1963,  and  the  DDC  Descriptor  Frequencies.  May  1966.  In  addition, 
terms  were  also  used  as  they  appeared  in  the  DDC  Technical  Abstract 
Bulletin  appended  to  abstracts  selected  for  dissemination.  A  publication 
received  too  late  for  use  in  the  pilot  program  was  Glossary  of  Plastics 
Terms:  A  Consensus.  December  1966,  Plastec,  Plastics  Technical  Eval¬ 
uation  Center,  Dover,  New  Jersey.  This  is  an  excellent  glossary  of  the 
commonly  used  technical  terms  in  the  fields  of  plastics,  adhesives,  and 
elastomers. 

As  could  be  expected,  there  was  a  considerable  overlap 
among  the  terms  uded  in  the  three  thesauri,  the  greatest  between  the 
Thesaurus  of  Astia  Descriptors  and  the  Thesaurus  of  Engineering  Terms. 
The  former  was  most  distinctive  in  the  areas  of  military  applications, 
whereas  the  latter  was  most  distinctive  in  the  areas  of  plastics,  polymers 
and  elastomers.  Each  document  (journal  article)  was  indexed  primarily 
from  only  one  of  the  three  thesauri.  The  choice  of  thesaurus  was  guided 
by  the  following  table : 

used  for 

plastics,  physical  properties  of  materials, 
hitfi-vacuum  techniques  and  equipment. 


I  «•  ' 


physical  measurements,  analytical  instrumentation 
and  refrigeration. 

(2)  Thesaurus  of  Astia  Descriptors  for  chemical,  bio¬ 
logical,  and  radiological  warfare,  chemistry  (or¬ 
ganic  and  inorganic),  protective  clothing,  and  space 
technology. 

(3)  Textile  Technology  Terms  used  for  synthetic  fibers, 
textiles,  dyeing,  detergents,  fabric  properties,  fiber 
properties,  and  clothing  manufacture. 


There  were  several  difficulties  encountered  in  utilizing  three 
thesauri  for  indexing  both  documents  and  interest  profiles.  The  most  com¬ 
mon  was  that  a  document  could  logically  be  indexed  by  more  than  one  the¬ 
saurus  so  that  personal  preference  of  the  indexer  played  some  part  in  de¬ 
termining  which  thesaurus  would  be  used.  This  caused  a  loss  in  potential 
selections  due  to  the  fact  that  a  particular  interest  profile  may  not  have 
contained  some  of  the  terms  from  the  thesaurus  used  to  index  a  document 
relevant  for  that  individual.  A  second  difficulty  was  the  lack  of  a  valid 
term  or  terms  to  describe  a  particular  document  or  interest.  This  situa¬ 
tion  most  frequently  related  to  articles  dealing  with  research  in  organic 
chemistry.  Understandably,  many  of  the  specific  chemical  names  were 
absent  from  all  three  thesauri. 

To  prevent  the  loss  of  valid  selections,  each  profile  was 
supplemented  by  terms  from  all  three  thesauri.  This  caused  an  overex¬ 
pansion  of  the  user  profile,  but  yielded  a  higher  recall  ratio.  A  much 
needed  exercise,  but  one  outside  the  scope  of  this  contract,  is  that  of  con¬ 
catenating  all  of  the  thesauri  pertinent  to  NLABS  endeavors.  This  project 
would  involve  the  keypunching  of  approximately  50, 000  terms  and  manual 
sifting  to  eliminate  duplicate  terms  and  synonyms,  and  would  have  to  be 
undertaken  prior  to  implementation  of  the  full-scale  SOI  system.  Although 
the  use  of  multiple  glossaries  is  feasible  during  a  pilot  test,  involving  a 
limited  document  sample  and  population  of  users,  it  would  be  unworkable 
in  full-scale  implementation. 

2.  Glossary  Organization  and  Coding 

Each  glossary  was  assigned  a  unique  two-digit  number.  The 
glossary  number  combined  with  the  alphabetic  spelling  of  the  term  consti¬ 
tuted  the  unit  for  comparison  between  document  terms  and  interest  profile 
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terms.  Even  though  a  term  may  have  been  spelled  identically  in  two  dif¬ 
ferent  glossaries,  a  match  between  a  profile  term  and  document  term  would 
not  occur  unless  the  glossary  numbers  were  identical.  This  rigid  associa¬ 
tion  of  term  and  glossary  was  maintained  in  order  to  trace  the  effect  of  the 
choice  of  glossary  on  the  selection  process.  In  addition  to  a  glossary  num¬ 
ber,  each  term  could  be  assigned  a  unique  four- digit  number  to  identify 
it  within  the  glossary.  The  term  number  combined  with  the  glossary  num¬ 
ber  could  be  used  in  the  matching  process  in  lieu  of  matching  the  alpha¬ 
betic  term  itself. 

Numbers  could  be  assigned  so  that  a  numeric  sort  could 
accomplish  an  alphabetic  sorting  of  the  terms  within  a  specific  thesaurus. 
This  practice  would  ultimately  facilitate  the  conversion  of  the  match  and 
selection  process  to  an  inverted  file  so  that  matching  time  by  computer 
could  be  substantially  reduced  (See  Figure  10). 

E.  USER  INTEREST  PROFILE  CONSTRUCTION 

Construction  of  the  user  profile  is  the  most  critical  and  the  most 
difficult  aspect  of  selective  dissemination  of  information.  The  task  of 
creating  ini  ere  at  profiles  can  be  much  simplified  by  a  coherently  orga¬ 
nized  source  of  technical  terms.  The  fact  that  a  single  coherent  indexing 
tool  was  not  available  during  the  conduct  of  the  pilot  test  understandably 
overcomplicated  the  task  of  creating  interest  profiles.  However,  the  tech¬ 
niques  used  to  circumvent  this  difficulty  allowed  for  the  creation  of  valid 
profiles  and  for  the  testing  of  several  different  profiling  techniques. 

As  pointed  out  in  Section  IV,  B,  computer  programs  wen*  written 
to  implement  the  concurrent  use  of  two  different  match  strategies.  Both 
strategies  exemplify  a  multidimensional  retrieval  approach.  Dimensions 
consisted  of  term  sets  exemplifying  different  |X>ints  of  view  or  different 
aspects  of  the  same  interest.  For  instance,  an  interest  may  be  expressed 
as  'diffusion  of  oxygen,  nitrogen,  and  carbon  dioxide  through  polyethylene 
and  other  polymer  films.  "  The  different  aspects  of  this  interest  are:  (1) 
the  process  (diffusion);  (2)  the  (ihysical  state  (gas)  and  chemical  composi¬ 
tion  (oxygen,  etc. )  of  the  diffusing  substance;  (3)  the  com|x>sition  of  the 
barrier  (oolyethylene);  and  (4)  the  physical  form  of  the  barrier  (fibn).  Any 
one  dimension  might  prove  to  be  the  most  limiting  factor  in  retrieval.  That 
is,  if  the  document  collection  contains  a  minimal  number  of  articles  on 
gasses,  then  this  facet  of  the  interest  profile  would  bo  the  most  limiting. 

The  ability  to  take  advantage  of  the  selectivity  of  one  dimension  of  .»  multi¬ 
dimensional  interest  enhances  both  relevance  and  recall.  It  was  for  this 
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Figure  10  Glossary 


reason  that  the  Boolean  and  weighted-term  strategies  were  selected  for 
testing  in  this  pilot  program. 

1.  Structure  of  an  Interest  Profile 


Each  interest  profile  consisted  01  one  or  more  sets  of  terms 
descrijjtive  of  each  aspect  of  a  multidimensional  interest.  In  testing  the 
weighted-term  strategy,  all  terms  in  an  interest  profile  were  carried  in 
the  same  set  so  that  essentially  the  profile  was  single  dimensional.  Di¬ 
mensions  were  simulated,  however,  by  giving  certain  terms  more  weight 
than  others.  In  testing  the  Boolean  strategy,  all  terms  were  given  the 
highest  possible  weight,  but  like  teims  were  grouped  together.  The  match 
strategy  required  that  at  least  one  term  from  each  group  match  a  term  in 
♦be  document  description.  The  two  methods  were  combined  for  some  pro¬ 
files,  so  that  selection  was  based  on  both  term  grouping  and  term  weights. 

In  the  first  dissemination  cycle,  ail  profiles  were  constructed 
according  to  the  weight  strategy  with  every  term  carrying  the  highest  pos¬ 
sible  weight.  Thus,  a  selection  would  be  assured  if  any  one  profile  term 
matched  a  document  description  term.  In  the  next  dissemination  cycle, 
some  of  the  profiles  contained  multiple  sot-,  of  terms  and  some  contained 
only  one  set  of  terms.  All  weighs  were  set  at  .  5  so  that  selection  would 
occur  only  if  two  or  more  profile  terms  matched  document  terms.  Pro¬ 
files  were  modified  from  the  first  cycle  by  addition  of  terms  included  in 
documents  selected  in  cycle  I.  Terms  were  also  added  from  documents 
processed  in  cycle  I.  but  not  automatically  selected.  (Participants  were 
allowed  to  scan  a  list  of  the  titles  of  documents  processed  in  cycle  I  and 
to  select  the  titles  that  appeared  to  be  interesting  wlucn  were  not  selected 
automatically  for  them. )  For  cycle  HI,  profiles  were  extensively  reworked 
to  eliminate  terms  which  were  suspected  of  causing  irrelevant  selections 
and  by  combining  terms  from  different  glossaries  into  the  same  profile. 

The  Utter  move  was  made  to  reduce  processing  time  and  had  no  effect  on 
selection.  In  cycle  III,  the  profile  structure  remained  basically  the  same, 
except  that  some  profiles  were  c.  mged  from  the  Boolean  to  the  weighted 
strategy.  In  cycle  IV,  profiles  were  again  reworked  on  the  basis  of  re¬ 
sponse  to  selections  from  cycle  lit.  In  thU  cycle,  weights  were  varied  to 
determine  the  effect  on  selection. 

2.  Preliminary  Screening  of  Terminology 

Derivation  of  the  initial  interest  profile  was  the  most  dif¬ 
ficult  phase  of  profiling.  Because  of  the  use  of  several  thesauri  and  a 
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limited  number  of  copies  of  each,  a  decision  was  made  to  present  each  par¬ 
ticipant  with  a  subset  of  terms  which  were  thought  to  reflect  his  interests 
based  on  his  own  statement.  These  sets  of  terms  were  assembled  into  a 
folder  for  each  participant.  At  a  group  meeting,  participants  were  asked 
to  select  the  terms  from  the  list  in  their  folder  and  to  record  them  on  a 
preprinted  profiling  form.  The  participants  were  also  allowed  tc  supplement 
the  terms  available  to  them  by  scanning  the  three  thesauri.  This  process 
was  necessary  for  more  than  half  of  the  participants,  since  the  terms  pro¬ 
vided  were  not  sufficient.  However,  all  participants  expressed  a  favorable 
opinion  of  being  provided  with  an  abbreviated  version  of  the  thesauri  that 
was  tailored  to  their  expressed  interests.  Even  though  more  than  half  of 
the  participants  selected  additional  terms  direct  from  the  thesauri,  most 
of  the  terms  which  they  recorded  weie  obtained  from  the  preprinted  lists. 

Profiling  was  accomplished  in  three,  two-hour  profiling  ses¬ 
sions.  Members  in  the  first  group  were  allowed  to  structure  their  profiles 
according  to  the  Boolean  strategy.  Members  in  the  second  group  used  a 
weighted  strategy.  Third-group  participants  were  allowed  to  use  a  combi¬ 
nation  of  the  two  methods.  For  the  initial  SOI  cycles,  however,  these  struc¬ 
tures  were  temporarily  suspended  until  further  experience  could  be  obtained 
on  the  percentage  of  matches  that  would  occur  with  the  document  sample. 

3.  Preparation  of  Interest  Profiles 

Initial  interest  profiles  were  keypunched  from  worksheets 
which  contained  four  preprinted  columns  (Sec  Figure  11).  Columns  were 
designated  for  profile  number,  term  set  number,  assigned  weight,  and  term 
with  glossary  number.  Initially,  profiles  were  punched  with  varying  weights 
and  levels  as  written,  but  after  the  decision  had  been  made  to  perform  a 
test  cycle  using  single -aspect  profiles  and  maximum  assigned  weignts,  the 
punched  cards  were  duplicated  and  a  standard  set  number  equal  to  one  weight 
of  .  999  were  gang  punched  into  the  entire  profile  deck.  In  subsequent  runs, 
levels  were  restored  to  the  profiles  and  weights  were  varied  from  the  min¬ 
imum  of  .  999.  Profiles  in  cycle  II  were  expanded  over  200%  from  those 
used  in  cycle  I.  In  cycle  III,  profiles  were  reduced  from  cycle  II  by  30%. 
Cycle  IV  profiles  required  very  few  changes  over  cycle  III  profiles.  The 
substantial  additions  made  over  cycle  I  profiles  necessitated  keypunching 
a  significant  number  of  additional  cards.  Changes  on  cycle  II  profiles, 
however,  required  mainly  the  elimination  of  cards.  Cycle  IV  changes  re¬ 
quired  the  addition  of  a  relatively  small  number  of  terms. 

F.  DISSEMINATION  OF  INFORMATION 

SDI  notices  were  prepared  by  conventional  office  duplicating  tech¬ 
niques  and  collated  for  distribution  on  EAM  card  processing  equipment. 
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|  Availability 

1  tor  Info  Only 

2  Will  Obtain 

3  For  Ref  Only 

4  K«>i  Circulation 

5  i-'t  rsonal  Coi"' 

*  Profiless01  thru  99 


2  Notice  Format 

1  Title 

2  Terms 

i  Abstract 


igure  li  Profile  Worksheet 
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3  clo3sary 


01  Originating  Activity 
02  Sponsor 
03  Contract  No. 

04  Publication  Source 
05  Personal  Author 
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Preparation  of  SDI  Notices 


The  title  page  of  each  journal  article  was  copied  onto  8  1/2 
x  14"  paper  on  a  Xerox  #914  Office  Copier  (See  Figure  5)  which  allows 
bound  volumes  to  be  copied  without  destroying  the  binding.  The  journal 
page  was  positioned  to  allow  a  free  space  at  the  bottom  of  the  copy  approx¬ 
imately  three  inches  high.  A  preprinted  keypunch  form  and  a  typed  strip 
of  paper  designating  the  journal  title,  volume,  number,  and  date  were 
placed  in  this  area.  The  keypunched  form  designated  three  card  types 
and  the  information  to  be  keypunched  in  each  card  field.  The  complete 
Xerox  copy  thus  provided  all  the  descriptive  cataloging  information,  in¬ 
cluding  the  title,  journal  citation,  authors,  author  affiliation,  contract  sup¬ 
port,  and  translation  information.  During  the  pilot  test,  only  title  and  jour¬ 
nal  source  were  recorded  in  machine -readable  form.  While  each  title  page 
was  being  keypunched,  subject  indexing  was  performed  on  the  original  jour¬ 
nal  article,  and  subject  terms  were  recorded  on  a  second  preprinted  key¬ 
punch  form  (See  Figure  6). 

On  return  of  the  document  title  jxige  copy,  punched  cards 
were  proofread  against  the  title  page  and  corrected.  At  the  conclusion  of 
the  correction  process,  the  information  to  appear  on  an  SDI  notice  was 
clipped  from  the  title  page  and  stored  in  a  numbered  envelope  (See  Figure 
7).  This  information  included  the  title  and  other  descriptive  cataloging 
information  and  the  abstract,  introduction,  or  some  portion  of  the  article 
itself.  These  clippings  were  stored  in  numbered  envelopes  filed  in  numer¬ 
ical  sequence  according  to  an  assigned  accession  number  stamped  on  the 
article  title  page  at  the  time  of  selection. 

Terms  keypunched  from  the  document  and  subscriber  pro¬ 
file  worksheets  are  input  to  the  computerized  match  function.  The  com¬ 
puter  program  matches  document  terms  on  one  magnetic  tape  against 
profile  terms  on  another. 

The  output  of  the  computer  matching  process  is  a  punched 
card  deck  containing  two  types  of  cards.  One  type  contains  the  document 
number  and  the  number  of  times  that  document  was  selected  for  dissemina¬ 
tion  (See  Figure  12).  The  other  card  contains  a  document  number  and  a 
subscriber  and  profile  number  (See  Figure  13).  After  the  two  card  types 
are  separated  by  sorting,  one  card  deck  is  sorted  into  numerical  order  by 
the  number  of  copies  of  each  document  required.  The  sorted  deck  is  listed 
to  provide  directions  for  the  notice  duplication  process  (See  Figure  14). 
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Figure  13  Distribution  Card 
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Figure  14  Copy  Card  Listing 
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During  pilot  ope  ration,  the  maximum  number  of  copies  re¬ 
quired  of  any  one  Sl)l  notice  was  fourteen  out  of  a  possible  twenty  live. 

Since  the  mean  number  of  copies  required  was  three,  reproduction  by  print¬ 
ing  was  uneconomical,  since  press  set-up  time  and  the  master  preparation 
would  cost  more  than  conventional  office  copying.  For  this  reason,  all  SDI 
notice  copies  were  prepared  on  the  Xerox  914  using  prescored  four -up 
IBM  card  stock  (See  Figure  15).  This  stock  is  a  recently  developed  item 
available  from  Essex  Systems  Incorporated  in  quantities  of  1000  sheets  (or 
4000  cards)  per  case. 

The  printed  list  containing  the  number  of  copies  required  of 
each  document  (See  Figure  14)  is  used  as  a  basis  for  assembling  the  clipped 
sections  of  journal  article  title  pages  into  master  sheets  requiring  the  same 
number  of  duplicate  copies.  First,  all  the  clippings  from  which  one  copy  is 
to  be  made  are  pulled  from  the  numerically  stored  envelopes;  then,  all  clip¬ 
pings  requiring  two  copies  are  pulled,  etc.  The  content  of  each  envelope  is 
fixed  to  a  format  sheet  blocked  out  to  correspond  with  the  four  IBM  card- 
sized  image  areas  of  the  Essex  card  stock  (See  Figure  16).  All  clippings 
requiring  one  copy  are  affixed  to  as  many  sheets  as  necessary.  The  same 
process  is  required  for  all  of  those  clippings  requiring  two  copies,  three 
copies,  etc. 

Each  format  sheet  containing  up  to  four  card  images  is  placed 
face  down  on  the  Xerox  914  copy  glass,  and  the  Essex  card  stock  is  loaded 
into  the  paper  stock  feeder.  In  some  instances,  I  wo  sides  of  a  card  are  re¬ 
quired  to  carry  the  information  to  be  disseminated.  When  this  is  necessary, 
the  flip-side  image  is  properly  positioned  on  the  reverse  side  of  the  format 
sheet.  After  duplication  of  the  front  side,  the  card  stock  is  turned  over, 
reloaded  into  the  feeder,  and  the  flip- side  image  is  copied. 

At  the  conclusion  of  the  duplicating  process,  the  card  stock 
is  bursted  (See  Figures  17  and  18)  and  the  required  notice  copies  are  assem¬ 
bled  behind  the  punched  and  interpreted  copy  card  generated  by  the  computer 
matching.  The  assembled  card  deck  thus  contains  a  header  card  punched 
and  interpreted  for  document  number  and  number  of  copies,  followed  by  the 
appropriate  number  of  imprinted  unpunched  notice  cards.  This  card  deck 
is  then  intersperse  gangpunched.  Punching  transfers  the  document  number 
in  the  header  card  into  each  one  of  the  notice  cards  behind  it.  A  first  sort 
removes  the  header  card,  the  remaining  card  deck  is  then  sorted  in  numer¬ 
ical  order  by  document  number  (See  Figure  19). 
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Adsorption  of  a  Nonionic  Surfoctant  by  Cotton  II.Ws  scnoTT 

AlIKIIiMT  tin-  il<lw>l  1*1  mil  .'Hill  dcsol |>ll..ll  Ilf  |>il\ i>\\ il  li\  l;(l<ll  t  . . 1  <*■>■«  I  :t  I II  i  ll^  .11. 

Ilf  II  ctliyl«*in'  ••Mill*  (Kl)j  iiuiis  mi  •*iiii.*n  win*  f«t mi ii*tl  IIVIT  .’I  i**UH'Ciilruti<m 
laiicr*  «*f  l«n  •lr«-:nli*s  in  Wist il !**•  1  water  at  'J.*i  (' 

Tin*  iih'IIhm)  •■••usistisl  in  c<pitlil>r:it ini;  ••••mm  falnie  nr  cli<ip|M*d  lilx*r  with  sm 
l.ielanl  solutions  ami  •  Icleriiiittiiii;  l In*  •■•|Mil iI»i ••mi  surfactant  I'oiicctilnliou  by 
fait*  lensiiiu  or  l»y  tin*  coli.dl  liluc  inl.iriiiii  i in*  analysis.  Siirfaci*  tension  is  nmn* 
iilfeclcl  l»y  sinlactant  molecules  of  few  Kit  mills,  whereas  the  eoloriinet I'ic  luetlioii 
cmplittsi/CH  the  »non*  highly  |s>lyo\yeilul.iie<l  surfaetalit  nioleeiiles;  the  ohservcii 
•hsi'rrpaney  lietwi*cii  tin*  two  methods  iltr  ,  '.ur ,  i  was  ra use* I  hy  |in*fer«*ntial  inlsorp 
lion  of  surfaelaiil  of  sliorlcr  hit  t  chain  Iciiulh  onto  latiioii. 

Tin*  ailsorpl hin  was  almost  completely  reveisihle.  Tin*  ailsorption  isotherm  levclcil 
oil  at  :io  lilt!.  surfaclatit/IUI  g.  cotton  near  the  critical  micelle  eoiieentratioti.  In  view 
•»f  the  s|n*eilic  surface  area  of  rollon,  this  level  of  ailsorltcd  surfactant  is  of  the  onler 
••I  tin*  amount  leipiinsl  to  rover  the  eollou  completely  with  a  close  packeil  monolayer 
•  •I  surfaelaiil  molecules  It  inn  Mat  .i|!ainst  the  sul<slrate 
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Surface  Dye  Sensitization.  II.  Sulfur  m  \  (  j  «S-1 ' » 

S.  \V.  1X0,  J».,  AND  V'.  S.  CtltAXO 
Krstanh  hthuralories,  Xerox  Corporal  ion,  Webster,  New  York 
(Received  28  July  1966) 

I.  INTRODUCTION 

SULFUR,  being  a  widc-bandgap  material,  has  very 
little  photoelectric  response  to  visible  light.  Spear 
and  Adams'  have  studied  the  electronic  structures  in 
orthorhombic  sulfur  crystals.  They  conclude,  tenta* 
lively,  that  there  is  a  v. deuce  hand  for  hole  conduction; 
however,  the  electrons  move  about  hy  a  hopping 
proa's-.  The  intrinsic  bundgap  energy  for  holc-clectron 
generation  is  reported*  to  lie  in  the  range  from  3.65  to 
4.30  eV. 


We  have  observed  that  certain  dyes  applied  to  the 
surface  of  the  crystalline  S  t  orthorhombic)  can  extend 
its  photo,  id  t lie  response  to  the  visible  region.  Sulfur 
is  the  l.irgist-lundgap  inorganic  material  we  have  seen 
which  exhibits  this  dye  sensitized  response 


Figure  16  Paste-up  of  Card 
(Four- up) 
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Figure  17  Bursting  of  Curdstock 
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fWtxi'U  m  i his  studs  In  tutlci  lo  stimulate  the 
•  es|Mi.ii»m  to  the  Ml  hu*is/mimiie  volume  levels 
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foot  has* line  \leteimtn.iiuMi  was  repealed  at  14.000 
It  el  eseisistnp  at  iKc-  umr  wotk  kiad  level  as  useJ 
in  the  htgh-alitiudc  tests  fhc  mask  maintained 
all  siihprsls  in  »  satislachuy  physnikiftenl  comly 
Imwi  al  all  altitudes  tip  to  and  including  40.1100  feet 
f< w  ihe  dutaiMtn  of  exposure  used  in  these  texts. 

<  AuthtM  | 
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()(|M  An  Infrared  Spectroscopic  Investigation  of 

Photodegraded  Styrene  Copolymer  Films 

# 


Raymond  B.  Stymour,  Hing-Shya  E.  Tsang,  Ooug  Warren,*  Chemistry  Department,  University  of  Houston 


The  degradation  of  oigumc  polymers  in  the  prcMiicc 
of  ultraviolet  light  is  Irtl,  ik  ally  and  economically  im¬ 
portant.  Considerable  emphasis  lias  lieen  pined  on  the 
development  of  lest  procedures  and  the  inw-stig.itiou  ol 
the  .nechoisism  of  phot  odegradat  ion 


Committee  l)-20  of  ||m>  American  Society  for  Testing 
Materials  has  esiahlished  a  sulicoinmitlrr  to  evaluate  the 
effect  nf  light  on  pcitnanence  pr<*pcrties  of  plastics.  Itec- 
otninended  practices  have  'neen  outlined  for  outdoor  (I) 
and  accelerated  weathering  tests  (2).  In  one  tent  pro¬ 
cedure,  the  inlen-.itv  of  n.ii  1  sunlight  is  increased  hy 
tire  use  of  miirOis  ( V, 


Figure  18  Single  Card  Notice 
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2.  Collation  and  Delivery  of  Notices 

The  distribution  cards  punched  with  a  document  number  and 
subscriber  and  profile  numbers  are  also  sorted  into  numerical  order  by 
document  number  (See  Figure  20).  These  cards  are  duplicated  into  a  pre¬ 
printed  response  card  (See  Figure  21).  The  response  cards  are  then  col¬ 
lated  with  the  notice  cards  so  that  response  cards  alternate  with  notice 
cards  in  the  resulting  merged  deck.  This  deck  is  then  intersperse  gang- 
punched  so  that  the  subscriber  and  profile  numbers  are  transferred  to  the 
notice  card  behind  each  response  card.  Punching  allows  the  deck  to  be 
sorted  into  numerical  order  by  profile  number  so  that  all  notices  and  re¬ 
sponse  cards  to  be  sent  to  one  participant  fall  in  the  same  sorting  pocket. 

During  the  pilot  operation,  packets  of  notice  and  response 
cards  were  addressed  by  hand.  In  full-scale  operation,  this  function  can 
be  accomplished  by  merging,  by  means  of  EAM  equipment,  an  address  head¬ 
er  card,  prepunched  and  interpreted  and  containing  a  distinctive  colored 
stripe  to  act  as  a  separator  between  the  packets  of  notice  and  response  cards. 

To  expedite  the  evaluation  of  the  notice  cards  and  return  of 
the  response  cards,  each  packet  of  response  and  notice  cards  was  hand- 
delivered  to  each  participant  in  the  pilot  program. 

G.  RESPONSE  FROM  PARTICIPANTS 


Participant  responses  were  obtained  oraJh  at  the  time  of  notice  de¬ 
livery  and  by  means  of  the  preprinted  response  car  d  associated  with  each 
notice  delivered. 

1.  The  Response  Form 

The  response  form  (See  Figure  21)  was  divided  into  four 
quarters  of  approximately  equal  size.  The  first  quarter  contained  document 
number,  subscriber  number,  and  profile  number  interpreted  at  the  upper 
left  hand  edge  of  the  card.  The  second  quarter  contained  marking  positions 
for  six  comments:  of  interest,  notify  when  available;  of  interest,  document 
not  wanted;  of  interest,  have  seen  copy;  of  no  interest;  refer  notice  to  Mr. 

_ ;  and,  modify  my  profile --changes  below.  The  third  quadrant 

contained  space  for  profile  modification,  and  the  fourth  quadrant  contained 
instructions  to  the  user. 

Since  the  response  and  the  notice  cards  were  hand  delivered, 
most  of  the  comments  pertaining  to  profile  changes  were  received  in  con¬ 
versation,  rather  than  in  the  response  cards.  Tlius  the  efficiency  of  this 
block  could  not  be  thoroughly  tested  during  the  pilot  test. 
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TT 

Profile  No. 
Subs.  No. 


Doc.  No. 


INSTRUCTIONS 

Please  complete  this  card  and 
return  it  to  the  lihrary.  Indicate 
your  degree  of  interest,  profile 
modification,  change  of  address. 
You  may  also  refer  this  abstract 
to  others. 

< _ - _ 


USER  RESPONSE 


□  of  interest  -  notify  when  available 
g]  of  interest  ~  document  not  wanted 

□  of  interest  -  have  seen  copy 

□  of  no  interest 

□  refer  notice  to 

□  modify  niy  profile  (changes  below): 


Figure  21  Response  Card 
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Since  the  response  and  the  notice  cards  were  hand  delivered, 
most  of  the  comments  pertaining  to  profile  changes  were  received  in  conver¬ 
sation,  rather  than  in  the  response  cards.  Thus  the  efficiency  of  this  block 
could  not  be  throughly  tested  during  the  pilot  test. 

A  few  participants  showed  a  marked  interest  in  referring  no¬ 
tices  to  fellow  workers  who  were  not  officially  participating  in  the  SDI  pro¬ 
gram.  Relatively  few  referrals  were  made,  however,  and  these  were  made 
by  few  participants.  Comments  most  frequently  checked  were:  of  interest, 
notify  when  available,  of  interest,  document  not  wanted;  and  of  no  interest. 

Particijiants  recorded  the  titles  of  the  journals  which  they 
regularly  read  prior  to  the  first  SDI  cycle.  Although  many  of  the  journals 
processed  during  the  the  pilot  test  were  listed  by  the  participants,  the  com¬ 
ment  "of  interest  --  have  seen  copy"  was  checked  very  infrequently.  It  can 
only  be  conjectured  at  this  time  that  the  information  was  being  delivered 
through  the  SDI  channels  at  a  more  rapid  rate  than  would  normally  be  achieved 
during  regular  reading  of  the  journals  themselves.  It  cannot  be  assumed 
that  the  participants  would  not  have  seen  all  of  the  articles  which  were  of  in¬ 
terest  to  them  as  marked  on  the  response  cards.  The  fact  that  these  journals 
were  out  of  circulation  during  the  processing  may  have  influenced  this  factor 
also. 

2.  Analysis  of  the  Responses  Returned 

Primarily,  resjKinscs  were  analyzed  to  determine  the  effect 
of  varying  the  search  strategy.  Tabulations  of  res|x>nse  versus  strategy 
were  analyzed  to  discover  whether  one  or  the  other  or  a  combination  of 
Boolean  and  tile  weighted  search  strategies  were  more  appropriate  and  yield¬ 
ed  a  higher  relevancy  rate. 

In  addition,  responses  were  analyzed  to  discover  what  corre¬ 
lation  existed  between  relevance  and  (1)  type  of  document,  (2)  number  of  pro¬ 
file  terms,  (3)  number  of  levels  per  profile,  (4)  notice  format,  (5)  job  type, 
and  other  factors. 

3.  Adjustment  of  Interest  Profiles 

Profiles  were  adjusted  after  each  SDI  cycle.  Follow  ing  the 
first  cycle,  terms  used  to  index  documents  selected  and  marked  of  interest 
by  u  |>artici|)unt  were  added  to  his  profile.  In  addition,  each  participant 
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scanned  a  list  of  titles  processed  during  the  first  cycle  and  marked  those 
which  he  thought  might  be  of  interest.  Terms  used  to  index  these  articles 
were  also  added  to  his  profile. 

After  the  second  SDI  cycle,  terms  which  were  thought  to  cause 
a  selection  of  irrelevant  documents  were  removed  from  the  participant's 
profile.  In  general,  the  profiles  were  condensed  in  size  and  the  total  num¬ 
ber  of  profiles  per  participant  was  condensed  in  order  to  speed  up  proces¬ 
sing  time.  An  attempt  was  also  made  to  select  the  most  limited  factor  ex¬ 
hibited  in  each  profile  so  that  searching  time  would  not  be  used  needlessly 
in  matching  frequently  occurring  terms  when  selection  would  be  limited  ul¬ 
timately  by  an  infrequently  occurring  term.  The  grouping  of  terms  into  sets 
was  also  rearranged  so  that  all  terms  in  a  set  would  have  a  similar  relation¬ 
ship  to  each  other. 

After  the  third  dissemination  cycle,  profiles  were  refined  by 
minor  adjustments  in  the  grouping  of  terms  and  the  assignment  of  weights. 


WAIASISOK  PILOT  1EST  RESULTS 


•Judging  from  the  results  of  the  pilot  test  conducted  at  Army  Natick 
Laboratories,  n  is  possible  to  predict  that  a  similar  SDI  system  can  be 
installed  to  supfx>rt  the  technical  personnel  at  other  Army  research  instal¬ 
lations.  Since  the  requirements  met  at  the  NLABS  installation  were  the 
most  stringent  in  terms  of  breadth  and  depth  of  subject  coverage,  an  even 
higher  success  rate  in  terms  of  average  relevancy  of  the  information  dis¬ 
seminated  can  be  predicted  for  other  installations.  In  the  following  analy¬ 
sis,  the  SDI  pilot  test  is  evaluated  in  terms  of  overall  effectiveness,  oper¬ 
ating  costs,  and  effort  necessary  to  implement  a  similar  SDI  system  at 
other  Army  installations.  In  addition,  the  statistics  on  the  four  SDI  pro¬ 
duction  runs  are  analyzed  in  terms  of  the  various  parameters  that  affect 
cost  and  relevancy. 

A.  EVALUATION  OF  EFFECTIVENESS  OF  SDI 
1.  Effect  of  Limitations  on  the  Pilot  Test 

The  most  obvious  limitation  on  the  pilot  test  was  the  selec¬ 
tion  of  a  limited  number  of  participants.  Although  this  limitation  precluded 
the  incorporation  of  the  complete  range  of  interests  existing  at  NLABS,  this 
limitation  did  not  impair  the  validity  of  the  results  obtained.  Interest  pro¬ 
files  of  the  twenty-five  fKirticipants  selected  were  sufficiently  representative 
of  the  total  range  of  interest  so  that  results  can  legitimately  be  extrapolated 
to  those  obtainable  under  full-scale  operation.  The  depth  o'  interest  ex¬ 
isting  in  each  profile1  was  representative  of  the  variations  in  depth  that  might 
be  encountered  in  a  lull-scale  operation.  This  range  in  depth  of  interest 
varied  from  one  participant,  whose  job  was  the  \>  riling  01  specifications,  to 
another  |>articipunt  whose  job  was  the  synthesis  of  new  acetylenic  organo- 
metallic  eom|K>unds  of  Group  IV  and  V  metals.  These  extremes  in  depth  of 
interest  can  be  detected  in  the  statistics  on  profile  structure  presented  in 
Appendix  A.  The  very  broad  interest  is  represented  by  a  profile  containing 
a  large  number  of  terms,  whereas  the  very  specific  interests  of  the  organic 
chemist  are  represented  by  less  than  fifteen  terms. 

Temporal  and  monetary  limitations  on  the  pilot  test  had  more 
significant  effect  on  the  results  obtained  than  did  the  limitations  on  the  num¬ 
ber  of  (xirticipants  selected.  Implicit  in  the1  conduct  of  a  pilot  test  is  the 
stringent  limitation  oti  the  time  available.  In  a  full-scale  implementation 
it  will  lie  possible  to  i>erform  a  wider  variety  of  tests  and  to  obtain  more 
detailed  explanations  of  the  phenomenon  which  occurred  during  the  pilot 
test.  The  substantiation  of  some  of  the  hypotheses  presented  in  this  anal¬ 
ysis  must  await  further  exploration.  Results  of  the  pilot  test  indicate  that 
it  may  be  possible  to  create  a  standard  technique  for  evaluating  all  SDI 
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systems  on  a  normalized  basis.  Some  of  these  analytical  techniques  are 
included  in  this  section  of  the  report  to  indicate  a  topic  for  additional  re¬ 
search. 


The  most  limiting  factor  affecting  the  outcome  of  the  SDI  pi¬ 
lot  test  was  the  lack  of  a  completely  integrated  subject  vocabulary.  The 
use  of  externally- produced  thesauri,  dictionaries,  and  glossaries  provides 
only  a  partial  solution  to  the  larger  vocabulary  problem.  The  results  of 
the  pilot  test  indicate  that  available  term  lists  provide  terms  which  are  too 
narrow  in  scope  for  use  in  selective  dissemination.  The  term  lists  used 
were  structured  in  a  hierarchical  array  at  a  depth  too  low  for  practical 
full-scale  use. 


A  specific  example  will  clarify  this  problem.  A  typical  in¬ 
terest  in  the  pilot  test  was  "high-temperature  properties  of  materials  for 
protection  against  thermal  radiation.  "  Glossary  terms  which  apply  to  this 
interest  are  the  following:  high  temperature  tests;  pyrolysis;  ablation; 
oxidation;  thermal  degradation;  thermal  stability;  melting  points;  thermal 
analysis;  high  temperature;  thermal  properties;  specific  heat;  calorimetry; 
infrared  radiation;  nuclear  blast  effects;  thermal  energy;  plastics;  poly¬ 
mers;  fibers  (synthetic);  fibers  (natural);  etc.  Since  the  indexer  has  such 
a  wide  variety  of  terms  from  which  he  can  select  a  subset  to  describe  a 
particular  document,  each  profile  must  contain  a  comparable  number  of 
terms  to  increase  the  probability  that  a  profile  term  will  match  a  document 
term.  However,  this  strategy  vastly  increases  the  number  of  irrelevant 
matches  since  the  total  number  of  term  combinations  that  might  cause  se¬ 
lections  is  proportional  to  a  factorial  function  of  the  number  of  terras  in 
the  subscriber's  profile.  What  is  needed  to  increase  the  efficiency  of  the 
matching  process  and  the  relevancy  of  information  distributed  is  a  com¬ 
plete  and  thorough  analysis  and  structuring  of  a  vocabulary  according  to 
the  specific  requirements  of  the  users  of  the  system.  This  endeavor  is 
properly  the  subject  of  a  separate  high-level  endeavor  utilizing  specialists 
knowledgeable  in  each  of  the  areas  of  interest  to  the  participants. 

The  creation  of  a  comprehensive  and  coherent  vocabulary 
would  not  only  increase  the  efficiency  of  the  selection  process  and  the  rel¬ 
evancy  of  disseminated  information,  but  would  also  s|>oed  up  the  document 
indexing  and  subscriber  profiling  processes.  If  terms  were  imbedded  in 
a  complete  hierarchy,  the  indexer  could  select  terms  down  to  the  exact 
level  required.  The  length  of  time  required  for  indexing  is  roughly  pro¬ 
portional  to  the  number  of  terms  that  must  be  selected  to  represent  the  sub¬ 
ject  content  of  a  document.  Using  pre-existing  vocabularies  arrayed  in  a 
two-ievel  hierarchy  required  the  indexer  to  select  an  inordinately  large 
number  of  specific  terms  to  represent  a  general  topic.  Correspondingly, 
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the  number  of  terms  used  in  describing  an  interest  profile  is  also  inordin¬ 
ately  large. 


Since  available  thesauri  do  not  utilize  the  inherent  structure 
existing  in  a  particular  subject  discipline,  problems  arise  in  both  subject 
indexing  of  documents  and  subscriber  interests.  Although  the  effort  in¬ 
volved  in  creating  a  comprehensive  vocabulary  is  a  large  initial  expendi¬ 
ture,  significant  savings  in  daily  expenditures  in  indexing  would  accrue  from 
such  an  effort. 

2.  Effectiveness  of  the  SDI  Notice  as  a  Self-Sufficient  Information 
Carrier 


In  this  pilot  test,  the  content  of  the  SDI  notice  was  obtained 
by  duplicating  selected  (portions  of  the  article  itself.  Both  the  front  and  re¬ 
verse  sides  of  the  notice  could  be  utilized  so  that  the  information  dissemi¬ 
nated  could  occupy  an  area  equivalent  to  that  of  two  KAM-sized  cards.  Since 
information  was  disseminated  only  as  it  existed  in  the  source  document,  an 
evaluation  of  the  notice  is  really  an  evaluation  of  the  quality  of  the  extract. 

The  majority  of  source  materials  consisted  of  two  types  of 
journal  articles,  those  appearing  .  trade  journals,  and  those  appearing  in 
scholarly  technical  journals.  In  many  of  the  technical  journals,  a  concise 
abstract  heads  each  article.  In  most  trade  journals  the  essence  of  an  ar¬ 
ticle  existed  only  in  an  introductory  paragraph  or  one  buried  in  the  text.  The 
significant  passages  within  each  document  were  marked  by  the  subject  in¬ 
dexer  for  inclusion  in  the  SDI  notice.  For  technical  reports,  abstracts  were 
extracted  from  the  Technical  Abstract  Bulletin,  published  by  Defense  Doc¬ 
umentation  Center,  and  Scientific  and  Technical  Aerospace  Reports,  pub¬ 
lished  by  National  Aeronautics  and  S|»ace  Administration. 

Almost  all  the  information  disseminated  by  this  means  was 
sufficient  to  indicate  the  subject  content  of  the  document  or  article.  On  the* 
basis  of  reading  the  notice,  purticiiuuits  could  decide,  in  most  eases,  wheth¬ 
er  or  not  they  would  like  to  read  the  complete  article. 

In  all  cases.  |Hirticipants  retained  the  SDI  notice  for  future 
reference.  Retention  of  the  abstract  indicates  that  the  notice  is  a  self- 
sufficient  information  carrier  of  some  permanent  value.  Abstracts  re¬ 
ceived  most  enthusiastically  were  of  the  informative  type  in  which  research 
results  were  rc|x>rtcd  in  brief  as  well  as  an  indication  of  the  document  con¬ 
tent.  Abstracts  from  technical  journals  were  received  least  enthusiasti¬ 
cally  because  of  low  information  content.  Some  purti'-ijMiits  remarked  that 
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trade*  journals  included  u  high  pro|*>rtion  of  promotional  mail  rial  printed  in 
the  format  of  a  technical  article. 

3.  Effect  of  the  SP1  Notice  on  Influencing  Information 

Acquisition  Habits 

In  many  instances,  participants  referred  a  notice  to  some¬ 
one  else.  The  direct  transfer  of  information  between  the  individuals  might 
play  a  larger  role  in  a  future  SDI  system.  In  essence,  the  original  recip¬ 
ient  of  a  notice  would  act  as  a  further  switching  | joint  in  channeling  informa¬ 
tion  to  the  right  person.  This  phenomenon  occurred  most  frequently  for  in¬ 
dividuals  working  within  the  same  branch  or  laboratory.  However,  some 
participants  referred  notices  to  individuals  outside  their  own  sphere  of  in¬ 
terest,  which  indicates  an  effective  system  of  personal  communication  with¬ 
in  Army  Natick  Laboratories. 

Over  the  short  period  during  the  pilot  test,  participants  still 
relied  heavily  on  former  sources  of  information.  Also,  participants  were 
well  aware  of  the  temporary  nature  of  the  pilot  test  and  thus  did  not  plan  to 
rely  substantially  on  it.  Since  photocopies  of  journal  articles  and  technical 
reports  were  not  distributed  automatically  through  SDI  feedback,  partici¬ 
pants  merely  retained  the  notice  for  reference  to  the  future  use  of  the  ar¬ 
ticles.  The  evaluation  of  the  effect  of  SDI  on  personal  information  acquisi¬ 
tion  habits  must  await  full-scale  implementation  in  which  provision  will  be 
made  for  automatic  distribution  of  photocopied  articles.  An  analysis  could 
then  be  made  of  the  relevancy  of  articles  selected  on  the  ktsis  of  the  ab¬ 
stract.  Other  long-term  studies  that  might  lx*  conducted  are  the  effect  on 
utilisation  of  library  information  resources  and  the  effect  on  interdisci¬ 
plinary  activities.  , 

B.  OPERATING  COSTS 

Operating  costs  are  analyzed  in  terms  of  materials,  equipment  uti¬ 
lization,  and  personnel  requirements.  The  total  cost  of  operating  the  pilot 
SDI  system  is  further  analyzed  into  the  unit  cost  per  document  processed, 
the  unit  cost  per  profile,  and  the  unit  cost  for  producing  one  SDI  notice. 

1.  Raw  Materials 


The  cost  of  raw  materials  is  negligible  compared  witn  other 
unit  costs  of  the  system.  The  five  materials  used  are  8  1/2  x  14"  paper, 

8  1/2x11"  paper.  Xerox  card  stock,  EAM  cards  and  rubber  cement.  The 
following  table  lists  unit  costs. 
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Pu|ier  (»  1/2  x  14") 
Paper  (K  1/2  x  11'} 
Xerox  Curd  Stock 
EAM  Cards 
.  Rubber  Cement 


$20.00  per  carton  (a,  000  sheets) 
$16,115  per  carton  (5,000  sheets) 
$75.  00  per  case  (4,  000  cards) 
$40.  50  per  carton  (10,000  cards) 
$  .  45  per  bottle 


The  Xerox  card  stock,  purchased  from  Essex  Systems,  Inc. 
in  New  York  City,  is  die  cut  sheets,  each  containing  four  EAA1  cards.  The 
cards  will  |>ass  through  any  low  speed  electronic  accounting  machine  equip¬ 
ment. 


2.  Equipment  Utilization 

The  highest  single  data  processing  cost  was  associated  with 
computer  usage.  Approximately  one  minute  is  used  in  matching  the  interest 
profiles  for  twenty-five  participants  against  the  terms  describing  one  docu¬ 
ment.  Total  time  is  proportional  to  the  product  of  the  total  number  of  docu¬ 
ment  terms  and  the  total  number  of  profile  terms.  For  the  fourth  production 
cycle,  235  documents  were  processed  in  three  hours. 

The  following  is  a  table  giving  equipment  utilization  per  work 

unit: 


Keypunch 
Card  Sorter 
Card  Sorter 
Card  Collator 
Card  Reproducer 
Interpreter 


3.  50  minutes  per  document 
.06  minutes  |>er  document 
.01  minutes  |*er  selection 
.01  minutes  fier  selection 
.01  minutes  |>er  selection 
.  02  minutes  per  selection 


A  Xerox  office  copier  was  used  to  produce  notice  cards.  In 
this  operation.  Xeroxing  title  pages  and  Tables  of  Contents  required  1. 54 
minutes  per  document  and  Xeroxing  notice  cards  required  32  minutes  |>er 
selection. 


3.  Personnel  Requirements 

One  trained  subject  indexer  can  index  approximately  thirty 
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documents  per  day  if  the  subject  terms  are  dictated  into  a  recording  ma¬ 
chine.  If  the  indexer  is  required  to  write  out  the  terms  on  an  indexing 
worksheet,  the  task  can  be  accomplished  at  the  rate  of  twenty  documents 
per  day.  Including  Xeroxing  of  notice  cards,  but  excluding  the  computer 
matching  process,  approximately  1,000  notices  could  be  processed  in  an 
eight-hour  day  by  one  person.  During  an  eight-hour  day,  one  clerk-typist 
could  process  approximately  twenty-five  documents,  including  the  Xeroxing 
and  stripping-up  of  title  pages. 

4.  Cost  Centers 


For  purposes  of  costing  out  the  operation  of  the  SDI  system, 
three  cost  centers  were  developed.  Document  processing  includes  all  those 
costs  attributable  to  making  the  document  available  to  the  matching  and  se¬ 
lection  function,  and  to  notice  duplication.  Distribution  processing  includes 
all  costs  associated  with  matching,  selecting,  and  processing  selections. 
Profile  costs  are  those  associated  with  indexing  the  interests  of  subscribers 
and  in  preparing  profiles  in  machine -readable  form.  Tables  2  and  3  give 
the  costs  for  document  and  distribution  processing.  The  third  cost  center, 
associated  with  profiling,  can  vary  over  a  wide  range  and  is  very  depen¬ 
dent  upon  the  indexer's  understanding  of  the  subject  matter  and  compre¬ 
hension  of  the  problems  described  by  the  participants.  Typical  unit  costs 
cannot  be  extrapolated  to  other  programs  because  of  varying  circumstances. 
During  the  course  of  the  pilot  test,  approximately  thirty  man-days  were 
used  in  profiling  activities.  Six  man-days  were  consumed  in  personal  and 
group  interviews. 

Compared  with  the  time  required  to  punch  document  subject 
cards  and  descriptive  cataloging  cards,  keypunch  requirements  for  pro¬ 
files  are  minor.  The  number  of  cards  required  for  each  profile  ranged  from 
ten  to  over  one  hundred.  Since  the  number  of  cards  requiring  re- keypunch¬ 
ing  due  profile  changes  is  relatively  small,  the  largest  keypunching  re¬ 
quirement  is  met  during  construction  of  the  initial  profile. 

5.  Proration  of  Costs  Between  Document  Services 

In  a  fully  integrated  system,  the  cost  of  document  proces¬ 
sing  would  be  shared  among  other  services.  The  same  cards  punched  for 
the  SDI  system  could  also  be  used  for  generating  accession  lists,  catalog 
cards,  book-form  indexes,  and  machine -searchable  files.  The  costs  in¬ 
curred  in  subscriber  profiling  and  document  selection  would  be  the  only 
ones  attributable  directly  to  an  SDI  system. 
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TABLE  2 

DOCUMENT  PROCESSING 


Task 

! - - - 

Minutes  Per 
Document 

Log  in  journal 

.  01 

Select  documents  ar.d  mark  extracts 

4.44 

Type  journal  citation 

.  30 

Xerox  journal  table  of  contents  and 
document  title  page 

1.  54 

Assign  and  stamp  accession  .iumt.  r 
and  complete  descriptive  catalogir 
worksheet 

1.54 

Keypunch  descriptive  cataloging 
worksheet 

3.00 

Proof  and  correct  title  and  journal 
citation 

.  27 

Sort  descriptive  cataloging  cards  by 
accession  number 

.02 

List  descriptive  cataloging  cards 

.01 

Perform  subject  indexing 

17.40 

Keypunch  subject  terms 

1. 50 

Proof  and  correct  subject  terms 

.  46 

Sort  subject  terms  by  accession  number 

.  08 

Log  out  journal 

.  01 

Cut  document  title  page  and  strip-up 

7.00 

Total  time 

37.  58  minutes 

TABLE  3 

DISTRIBUTION  PROCESSING 

Minutes  Per 


Tasks  Selection 


Match  profile  terms  against 
document  terms  .  65 

Sort  copy  at  distribution  cards  .  01 

List  copy  cards  .  01 

Xerox  notice  cards  .  32 

Burst  notice  cards  .  20 

Collate  notice  and  copy  cards  .  04 

Intersperse  gang  punch  notice  cards  .  01 

Sort  notice  cards  .  01 

Duplicate  distribution  cards  into 
response  cards  .01 

Merge  response  and  notice  cards  .  01 

Intersperse  gang  punch  notice  cards  .  01 

Interpret  response  and  notice  cards  .  02 

Sort  distribution  and  notice  cards  .01 


Total  time 


1.31  minutes 


c. 


IMPLEMENTATION  AT  OTHER  ARMY  R&D  FACILITIES 


1.  Adaptation  of  Computer  Programs 

Computer  programs  written  for  the  pilot  SDI  system  can  be 
used  with  minor  modifications  at  other  installations.  Three  computer  pro¬ 
grams  are  available  lor  maintaining  a  glossary  of  technical  terms,  record¬ 
ing  document  and  profile  terms  on  magnetic  tapes,  and  matching  the  two 
resulting  tapes  to  obtain  SDI  selections.  Whether  it  is  desirable  to  use 
these  programs  depends  upon  the  current  degree  of  mechanization  of  the 
adopting  library.  Highly-mechanized  libraries  will  probably  prefer  to 
rewrite  the  programs  so  that  the  SDI  system  is  closely  integrated  with 
other  presently  mechanized  library  services.  On  the  other  hand,  libraries 
not  currently  mechanized  will  probably  want  to  utilize  these  programs  as 
a  starting  point.  Since  the  objectives  of  the  pilot  test  were  broader  than 
that  of  merely  developing  computer  programs,  users  may  also  wish  to 
modify  the  programs  to  increase  their  operating  efficiency  and  thereby  re¬ 
duce  the  total  computer  utilization  time. 

2.  System  Modification  to  Meet  Local  Requirements 

The  method  of  producing  the  SDI  notice  card  described  in 
this  report  is  recommended  for  three  situations:  a)  ready  availability  of 
pre- abstracted  materials,  b)  large  number  of  copies  (i.  e. ,  over  ten)  re¬ 
quired  for  each  document,  and  c)  requirement  to  preserve  information 
which  cannot  be  keypunched.  Any  one  or  combination  of  these  three  cir¬ 
cumstances  would  warrant  the  production  of  SDI  notice  cards  by  the  method 
described  in  this  report.  In  circumstances  other  than  these,  a  convention¬ 
al  SDI  system  utilizing  the  on-line  computer  printer  to  produce  notice  im¬ 
ages  is  more  appropriate. 

3.  Development  of  a  Technical  Vocabulary 

The  degree  of  efficiency  of  the  SDI  matching  function  is  di¬ 
rectly  proportional  to  the  quality  of  the  subject  indexing  tool  and  its  appli¬ 
cation  in  the  indexing  of  both  subscriber  profiles  and  documents.  In  the 
circumstances  encountered  at  NLABS,  a  much  higher  efficiency  could  have 
been  attained  had  an  appropriate  indexing  tool  been  available.  At  some  in¬ 
stallations,  it  will  be  possible  to  utilize  pre-existing  externally  produced 
thesauri.  The  adoption  of  this  technique,  however,  should  be  tested  on  a 
small  scale  so  that  experience  can  be  gained  to  gage  the  suitability  of  this 
approach. 


4. 


Personnel  Training 


Very  little  training  is  required  to  implement  and  operate 
this  SDI  system.  Assuming  that  librarians  and  data  processing  personnel 
will  be  responsible  for  carrying  out  the  major  processing  operations,  these 
personnel  can  learn  their  respective  roles  through  reading  the  implementa¬ 
tion  manual  which  constitutes  Volume  II  of  this  report.  No  specialized 
skills  are  required  other  than  those  normally  encountered  in  document  in¬ 
dexing  and  data  processing.  Clerk-typists  can  assume  the  bulk  of  work 
load  involved  in  notice  production.  Clerical  operations  are  restricted  to 
typing.  Xeroxing,  cutting  and  stripping.  If  skilled  technical  typists  are 
available,  they  can  be  utilized  to  assume  some  of  the  document  indexing 
work  load  by  transcribing  technical  terms  dictated  by  the  subject  indexer. 

In  most  installations,  programs  or  plugboards  are  readily 
available  to  accomplish  the  interpreting,  collating,  and  reproducing  func¬ 
tions. 


5.  Inter  laboratory  Cooperation 

As  mentioned  above,  one  of  the  conditions  under  which  the 
proposed  method  of  notice  production  is  recommended  is  that  of  the  avail¬ 
ability  of  pre-existing  abstracts.  One  of  the  most  attractive  features  of 
this  method  is  the  universality  of  the  graphic  image.  Data  in  machine- 
readable  form  usually  must  be  converted  before  utilization  in  another  sys¬ 
tem.  Systems  utilizing  graphic  reproduction  of  SDI  notices,  however,  can 
share  a  common  file  of  original  printed  information.  Any  image  printed 
on  an  EAM  card  can  be  utilized  with  equal  facility  by  all  installations. 

Although  information  contained  in  an  abstract  may  be  shared 
by  different  systems,  the  approach  to  subject  indi"  ing  may  differ  signifi¬ 
cantly  so  that  utilization  of  machine -readable  index  terms  may  not  be  pos¬ 
sible.  However,  should  the  sharing  of  machine- readable  data  be  intellec¬ 
tually  feasible,  the  format  for  machine -readable  processing  in  this  system 
has  teen  simplified  so  that  the  sharing  of  machine- readable  data  can  be 
accomplished  with  minimal  file  conversion  problems.  Each  document  term 
is  contained  within  a  single  record  on  the  magnetic  tape.  This  record  con¬ 
tains  the  characters  representing  the  term,  the  number  of  the  subject  in¬ 
dexing  tool  from  which  the  term  was  extracted,  and  the  document  number. 

D.  INTERPRETATION  OF  PRODUCTION  STATISTICS 


Statistics  on  the  four  SDI  production  runs  are  presented  in  Ap[x.*n- 
dixes  A  and  H.  Appendix  A  presents  the  statistics  necessary  to  derive 


the  relationship  between  relevancy  of  selections  and  various  parameters 
such  as  profile  type,  number  of  terms,  and  term  source.  Appendix  B  pre¬ 
sents  the  distribution  of  the  number  of  selections  for  each  document. 


1.  Relevancy  vs.  Size  of  Profile 

There  does  not  seem  to  be  a  strong  relationship  between  the 
number  of  terms  in  a  profile  and  the  degree  of  relevancy  of  selections.  One 
relationship  that  does  hold  is  that  75%  of  the  profiles  containing  less  than  20 
terms  are  50%  relevant  or  better.  The  average  percent  of  profiles  achieving 
relevancy  greater  than  50%  is  respectively  64%,  30%,  48%,  and  52%  for  the 
four  production  runs. 

2.  Relevancy  vs.  Glossary 

Profiles  indexed  by  the  ITT  Thesaurus  yield  the  most  rele¬ 
vant  selections  whereas  those  profiled  by  the  EJC  Thesaurus  yield  the  least 
relevant  documents.  The  validity  of  this  conclusion  is  supported  intuitively 
since  it  could  be  expected  that,  for  NLABS  interests,  most  relevant  terms 
would  be  obtained  from  a  textile  glossary,  *  .jereas  least  relevant  terms 
would  be  obtained  from  a  general  engineering  glossary. 

3.  Relevancy  vs.  Type  of  Profile 

A  true  relationship  between  the  value  of  the  weighted  profile 
versus  the  Boolean  profile  cannot  be  determined  from  the  test  results  be¬ 
cause  weighted  profiles  were  used  especially  in  difficult  cases  where  the 
predicted  number  of  matches  was  low.  In  the  last  production  run,  however, 
the  average  relevancy  for  weighted  profiles  was  39%  and  accounted  for  50% 
of  the  0%  relevancy  profiles  (2  out  of  4).  Deleting  weighted  profile  relevancies 
raises  the  average  relevancy  for  the  remaining  profiles  from  45%  to  47%. 

4.  Man  vs.  Machine 

On  the  last  production  cycle,  an  experiment  was  conducted  to 
determine  whether  manual  selection  or  computer  selection  yielded  the  high¬ 
est  relevancy  ratios.  The  results  of  this  experiment  are  summarized  in  the 
Table  4  entitled  "  Statistics  of  Manual  Versus  Computer  Selection."  The  last 
two  columns  of  this  table  interpret  the  meaning  of  the  statistics  presented. 

The  column  entitled  "Which  Method  Missed  More  Relevant  Documents, " 

8 hows  that  a  person  selecting  on  the  basis  of  document  title  alone  misses 
more  relevant  documents  than  the  computer  selection  process  based  <<n  sub¬ 
ject  terms  assigned  by  that  same  person.  The  second  column  entitled  "Which 
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TABLE  4 


STATISTICS  OF  MANUAL  VS.  COMPUTER  SELECTION 


Profile 

No. 

Matches 
( Manual) 

Of  Interest 
( Manual) 

%  of  Interest 
( Manual) 

Matches 
{ Machine) 

Of  Interest 
( Machine) 

%  of  Interest 
(Machine) 

Which  Method 
Missed  More  Rel¬ 
evant  Documents? 

Which  Method 
Added  More  Irrel¬ 
evant  Documents? 

1 

9 

■ 

44 

12 

8 

67 

Man. 

- 

2 

50 

3 

6 

41 

H 

BB 

Bgfijf 

Man. 

3 

22 

3 

14 

20 

■a 

35 

Man. 

Man. 

4 

20 

10 

50 

31 

n 

35 

Man. 

Comp. 

5 

17 

15 

88 

24 

18 

75 

Man. 

Comp. 

6 

12 

11 

91 

29 

21 

72 

Man. 

Comp. 

7 

9 

1 

11 

4 

3 

75 

Man. 

Man. 

8 

2 

1 

50 

2 

1 

50 

- 

- 

9 

1 

1 

100 

2 

0 

0 

Comp. 

Comp. 

10 

8 

m 

50 

1 

0 

0 

Comp. 

Man. 

11 

8 

3 

38 

27 

8 

29 

Man. 

Comp. 

12 

26 

26 

100 

16 

16 

100 

Comp. 

- 

16 

0 

- 

- 

■1 

0 

0 

Comp. 

17 

28 

■1 

14 

36 

6 

17 

Man. 

Comp. 

21 

10 

6 

60 

12 

11 

92 

Man. 

Man. 

22 

16 

11 

69 

4 

■a 

75 

Comp. 

Man. 

23 

9 

■a 

78 

12 

■a 

58 

- 

Comp. 

24 

8 

m 

50 

14 

8 

57 

Man. 

Comp. 

25 

■B 

- 

- 

■a 

- 

B 

- 

- 

26 

m 

3 

75 

■a 

MM 

57 

Man. 

Comp. 

28 

o 

- 

- 

10 

■a 

50 

Man. 

Comp. 

mem 

Hi 

ir> 

33 

68 

19 

28 

Man. 

Comp. 

.»o 

0 

- 

- 

1 

0 

0 

- 

Comp. 

;ti 

0 

- 

- 

0 

- 

- 

- 

- 

32 

10 

8 

80 

13 

6 

46 

Comp. 

Comp. 
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Method  Added  More  Irrelevant  Documents, "  shows  that  the  computer  tends 
to  select  more  irrelevant  documents  than  the  person  selecting  on  the  basis 
of  title.  This  experiment  indicates  that  the  overall  relevancy  of  an  SDI  sys¬ 
tem  could  be  much  enhanced  by  human  pre-screening  of  the  computer  se¬ 
lected  SDI  notices.  Average  relevancy  for  manual  selection  was  55%  as  com¬ 
pared  to  45%  for  computer  selection.  The  difference  in  relevancy,  however, 
is  attributable  to  the  larger  number  of  irrelevant  documents  selected  by  the 
computer. 


The  fact  that  computer  selection  also  misses  relevant  docu¬ 
ments  is  supported  by  the  last  column  of  statistics  on  the  first  SDI  cycle 
presented  in  Appendix  A.  This  column  labeled  "Recall"  shows  the  percent¬ 
age  of  relevant  documents  missed,  based  on  an  estimation  performed  by  each 
participant  who  scanned  a  total  list  of  titles  processed  in  the  first  SDI  cycle. 

5.  Summary  of  Relevancy  Statistics 

Table  5  summarizes  the  relevancy  statistics  derived  from 
Appendix  A.  This  table  shows  that,  for  instance,  the  percentage  of  profiles 
achieving  50%  relevancy  or  better,  was  64%  for  Run  I,  30%  for  Run  II,  48% 
for  Run  III,  and  52%  for  Run  IV.  The  low  relevancy  for  the  second  run  was 
due  to  the  addition  of  terms  used  to  index  documents  judged  relevant  that 
were  selected  during  Run  I.  This  extreme  dip  in  relevancy  shows  that  terms 
from  relevant  documents  cannot  be  used  as  a  basis  for  profile  readjustment. 

Table  6  summarizes  relevancy  and  number  of  relevant  matches 
for  each  subscriber.  These  figures  are  an  extraction  from  Appendix  A. 

6.  Document  Selection  Pattern 


Appendix  B  shows  the  distribution  of  the  number  of  times 
documents  were  selected  in  each  of  the  production  cycles.  These  statistics 
can  be  used  to  plan  reproduction  processing  since  they  show  how  many  docu¬ 
ments  were  selected  once,  tw<ce,  three  times,  etc.  A  significant  factor  is 
the  number  of  documents  selected  0  tie  ms,  since  this  indicates  the  relative 
match  of  the  document  collection  to  the  needs  of  the  subscribers.  For  the 
four  production  cycles,  the  numbers  of  documents  selected  0  times  were 
respectively  15  out  of  1 1 1  processed,  186  out  of  418  processed,  86  out  of  247 
processed,  and  88  out  of  235  processed.  Those  documents  selected  two  or 
three  times  accounted  for  the  highest  percentage  of  notices. 
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TABLE  5 

PERCENT  RELEVANCY  VS.  NUMBER  OF 
RELEVANT  MATCHES 


Profile 

No. 

Run 

I 

Run 

II 

Run 

III 

Run 

IV 

1 

50/7 

29/2 

40/4 

67/8 

2 

36/9 

8/5 

22/8 

7/3 

3 

50/ 2 

27/20 

21/6 

35/7 

4 

53/10 

21/16 

30/12 

35/11 

5 

47/7 

41/17 

83/20 

75/18 

6 

83/15 

56/44 

50/14 

72/21 

7 

50/1 

0/0 

100/1 

75/3 

8 

-/- 

41/25 

0/0 

50/1 

9 

25/9 

0/0 

0/0 

0/0 

10 

71/15 

42/ 13 

60/3 

0/0 

11 

10/2 

4/1 

21/7 

29/8 

12 

20/7 

87/46 

100/7 

100/16 

16 

50/1 

40/2 

0/0 

0/0 

17 

42/8 

12/8 

10/4 

17/6 

21 

71/17 

72/39 

73/11 

92/11 

22 

30/3 

90/9 

0/0 

75/3 

23 

69/9 

56/23 

50/2 

58/7 

24 

70/14 

-/- 

57/12 

57/8 

25 

-/- 

55/6 

-/- 

-/- 

26 

64/9 

31/13 

51/5 

57/4 

28 

100/3 

21/6 

60/3 

50/5 

29 

66/25 

37/23 

33/22 

28/19 

30 

33/1 

60/3 

100/1 

20/0 

31 

-/- 

-/- 

32 

90/4 

33/2 

18/2 

46/6 
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TABLE  6 

SUMMARY  OF  RELEVANCY  STATISTICS 


Average  relevancy 

Run 

I 

54 

Average  of  profiles  achieving 

over  10%  relevancy 

56 

Percent  profiles  achieving 

relevancy  greater  than  or 

equal  to;  90% 

9 

80% 

14 

70% 

27 

60% 

41 

50% 

64 

40% 

73 

30% 

86 

20% 

95 

10% 

100 

0% 

100 

Run 

Run 

III 

IV 

43 

45 

54 

57 

13 

9 

17 

9 

22 

26 

30 

30 

48 

52 

52 

57 

61 

65 

74 

74 

83 

78 

100 

100 

VI.  RECOMMENDATIONS  AND  CONCLUSIONS 


A.  RECOMMENDATIONS 

1.  Before  implementation  of  a  full-scale  SDI  system,  each  in¬ 
stallation  should  carefully  analyze  its  subject  indexing  requirements.  If  a 
controlled  vocabulary  does  not  already  exist,  existing  vocabularies  can  be 
profitably  tested  using  the  model  described  in  this  pilot  test. 

2.  Three  criteria  should  be  used  in  determining  SDI  reproduction 
requirements.  These  criteria  are:  (a)  need  to  reproduce  formulae  and  other 
graphic  representations,  (b)  medium  to  high  notice  copy  requirements,  (c) 
availability  of  pre-existing  abstracts. 

3.  An  SDI  system  should  be  integrated  with  other  mechanized 
information  services  so  that  multiple  use  can  be  made  of  document  data 
currently  put  in  machine -readable  form. 

4.  An  SDI  system  should  be  implemented  in  stages  to  acquaint 
the  users  and  the  operators  with  the  methods  and  procedures  employed  and 
to  uncover  special  problems  that  may  exist. 

5.  Full-scale  implementation  should  utilize  inverted  files  to  re¬ 
duce  computer  matching  time. 

6.  Indexers  should  be  employed  who  thoroughly  understand  the 
intricacies  of  the  <  object  being  indexed. 

7.  Maximum  use  should  be  made  of  document  data  already  in 
machine -readable  form  from  external  sources. 

8.  This  model  system  slmuld  continue  to  be  used  as  a  research 
tool  in  discovering  additional  relationships  that  may  exist  between  SDI  param¬ 
eters. 


B.  CONCLUSIONS 

1.  A  potential  exists  for  deriving  highly  relevant  profiles  con¬ 
structed  by  a  combination  of  the  Boolean  and  Weighted  match  strategies. 

2.  When  subject  interests  are  diverse  and,  at  the  same  time,  in- 
depth,  a  hierarchical  structuring  of  vocabulary  terms  is  necessary  to  achieve 
high  relevancy  si  an  efficient  level. 

3.  Matching  on  inverted  files  yields  higher  processing  efficiency. 
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4.  Clcse  contact  with  the  system's  users  is  extremely  neces¬ 
sary,  both  from  the  standpoint  of  communications  and  personal  relations. 

5.  There  is  :n  inverse  relationship  between  the  number  of  se¬ 
lections  made  by  the  computer  and  the  percentage  of  relevant  selections. 

6.  A  strong  relationship  does  not  exist  between  the  number  of 
terms  in  the  profile  and  the  rele  anc’  cf  selections. 

7.  Pre-existmg  abstracts  for  technical  journals  and  Government 
reports  are  sufficiently  informative  to  be  used  on  the  SDI  notice,  without 
modification. 
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DOCUMENT  SELECTION  PATTERN  FOR  FIRST  PRODUCTION  CYCLE 


15 

Documents  were  selected 

0 

times. 

giving 

0 

notices 

19 

II 

II  it 

1 

II 

II 

19 

II 

13 

If 

It  II 

2 

II 

II 

26 

II 

29 

It 

M  II 

3 

II 

II 

87 

II 

8 

tf 

II  II 

4 

It 

II 

32 

II 

12 

ft 

II  II 

5 

II 

II 

60 

II 

4 

•f 

II  II 

6 

II 

It 

24 

II 

8 

II 

II  II 

7 

II 

It 

56 

II 

3 

II 

II  II 

8 

II 

II 

24 

II 

111 

Documents  were  processed  giving 

328 

notices 

Notices 

%  Total  Cumulative  % 

Documents 

%  Total 

Cumulative  % 

87 

26.5 

26.5 

29 

26.  1 

26.  1 

60 

18.3 

44.8 

12 

10.  8 

36.  9 

56 

17. 1 

6!. 9 

8 

7.2 

44.  1 

32 

9.8 

71.7 

8 

7.2 

51.  3 

26 

7.9 

79.6 

13 

11.7 

63.0 

24 

7.3 

86.9 

4 

3.6 

66.  6 

24 

7.3 

94.2 

3 

2.7 

69.3 

19 

5.8 

100.0 

19 

17. 1 

85.4 

0 

0 

100.0 

15 

13.  5 

99.9 

328 

100.0 

ill 

99.  9 
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DOCUMENT  SELECTION  PATTERN  FOR  SECOND  PRODUCTION  CYCLE 
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II 

II 

63 
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tf 
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10 

tl 
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10 

6 

tl 

M  M 

11 

II 

II 

66 

1 

It 

•1  II 

12 

II 

II 

36 

4 

II 

II  II 

13 

II 

it 

52 

6 

II 

II  II 

14 

II 

»! 

84 

0 

II 

II  II 

15 

II 

♦  1 

0 

1 

II 

II  II 

16 

♦  1 

II 

16 

418 

Documents  were  processed  giving 

886 

notices 


Notices 

%  Total 

Cumulative  % 

Documents 

Total 

Cumulative  % 

108 

12.2 

12.2 

36 

8.  6 

8.  6 

88 

9.9 

22.  1 

44 

10.  5 

19.  1 

84 

9.  5 

31.6 

6 

1.  4 

20.  5 

78 

8.6 

40.2 

19 

4.5 

25.0 

71 

8.0 

48.2 

71 

17.0 

42.0 

66 

7.  5 

55.7 

6 

1.  4 

43.4 

63 

7.  1 

62.8 

7 

1.  7 

45.  1 

56 

6.  3 

69.  1 

8 

1.9 

47.  0 

56 

6.  3 

75.4 

7 

1.7 

48.7 

54 

6.  1 

81.  5 

9 

2.2 

50.  9 

52 

5.9 

87.4 

4 

1.0 

51.  9 

SO 

5.6 

93.0 

10 

2.  4 

54.  3 

36 

4.  1 

97.  1 

3 

0.  7 

55.  0 

16 

1.  8 

98.9 

1 

0.2 

55.2 

10 

1.  1 

100.0 

1 

0.2 

55.  4 

0 

0 

100.0 

186 

44.  5 

99.  9 

8  86 

100.  0 

418 
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DOCUMENT  SELECTION  PATTERN  FOR  THIRD  PRODUCTION  CYCLE 
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DOCUMENT  SELECTION  PATTERN  FOR  FOURTH  PRODUCTION  CYCLE 
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13  abstract  This  report  presents  the  results  of  a  nine-month  test  of  a  prototype 
SDI  system  developed  for  Army  Technical  Libraries.  During  the  pilot  test,  one 
thousand  documents  were  cataloged  and  indexed  and  disseminated  to  twenty-five 
scientific  and  technical  personnel  at  U.  S.  Army  Natick  Laboratories.  During 
the  course  of  the  pilot  test,  statistics  were  accumulated  on  operating  costs  and 
various  parameters  affecting  the  relevancy  of  disseminated  information.  The 
prototype  SDI  system  utilizes  graphic  techniques  for  producing  multiple  copies  of 
each  document  abstract.  The  output  of  the  computerized  matching  function  is  a 
set  of  EAM  cards  which  are  subsequently  used  to  collate  reproduced  abstracts 
for  dissemination.  Collation  is  accomplished  by  interspersed  gangpunching, 
merging,  and  sorting  techniques. 
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